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(57) Abstract 

Methods, recombinant host cells and kits are disclosed for 
the production of members of specific binding pairs (sbp), e.g. an- 
tibodies, using display on the surface of secreted replicable genet- 
ic display packages (rgdps), e.g. filamentous phage. To produce a 
library of great diversity recombination occurs between first and 
second vectors comprising nucleic acid encoding first and second 
polypeptide chains of sbp members respectively, thereby produ- 
cing recombinant vectors each encoding both a first and a second 
polypeptide chain component of an sbp member. The recombina- 
tion may take place in vitro or intracellular^ and may be site-spe- 
cific, e.g. involving use of the loxP sequence and mutants thereof. 
Recombination may take place after prior screening or selecting 
for rgdps displaying sbp members which bind complementary sbp 
member of interest. 
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METHODS FOR PRODUCING MEMBERS OF 
SPECIFIC BINDING PAIRS 

The present invention relates to methods for 
producing members of specific binding pairs (sbp). In 
5 particular, the present invention relates to methods for 
producing members of specific binding pairs involving 
recombination between vectors which comprise nucleic 
acid encoding polypeptide chain components of sbp 
members • 

10 Structurally, the simplest antibody (IgG) 

comprises four polypeptide chains, two heavy (H) chains 
and two light (L) chains inter-connected by disulphide 
bonds (see figure 1). The light chains exist in two 
distinct forms called kappa (K) and lambda (X). Each 

15 chain has a constant region (C) and a variable 

region (V). Each chain is organized into a series of 
domains. The light chains have two domains, 
corresponding to the C region and the other to the V 
region. The heavy chains have four domains, one 

20 corresponding to the V region and three domains (1,2 and 
3) in the C region. The antibody has two arms (each arm 
being a Fab region), each of which has a VL and a VH 
region associated with each other. It is this pair of V 
regions (VL and VH) that differ from one antibody to 

25 another (owing to amino acid sequence variations), and 
which together are responsible for recognising the 
antigen and providing an antigen binding site ( ABS ) . In 
even more detail, each V region is made up from three 
complementarity determining regions (CDR) separated by 

30 four framework regions (FR). The CDR's are the most 

variable part of the variable regions, and they perform 
the critical antigen binding function. The CDR regions 
are derived from many potential germ line sequences via 
a complex process involving recombination, mutation and 

35 selection. 

It has been shown that the function of binding 
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antigens can be performed by fragments of a whole 
antibody. Example binding fragments are (i) the Fab 
fragment consisting of the VL, VH, CL and CHI domains; 
(ii) the Fd fragment consisting of the VH and CHI 
5 domains; (iii) the Fv fragment consisting of the VL and 
VH domains of a single arm of an antibody , (iv) the dAb 
fragment (Ward, E.S. et al.. Nature 341, 544-546 (1989) 
which consists of a VH domain; (v) isolated CDR regions; 
and (vi) F(ab' ) 2 fragments , a bivalent fragment 

10 comprising two Fab fragments linked by a disulphide 
bridge at the hinge region. 

Although the two domains of the Fv fragment are 
coded for by separate genes, it has proved possible to 
make a synthetic linker that enables them to be made as 

15 a single protein chain (known as single chain Fv (scFv); 
Bird, R.E. et al., Science 242 . (1988) Huston, 

J.S. et al., Proc. Natl. Acad. Sci., USA 85, 5879-5883 
(1988)) by recombinant methods. These scFv fragments 
were assembled from genes from monoclonals that had been 

20 previously isolated. 

Bacteriophage have been constructed that express 
and display at their surface a large biologically 
functional binding molecule (eg antibody fragments, and 
enzymes and receptors) and which remain intact and 

25 infectious. This is described in WO 92/01047, the 

disclosure of which is herein incorporated by reference. 
Readers of the present document are urged to consult WO 
92/01047 for detailed explanation of many of the 
procedures used in the experiments described herein. 

30 The applicants have called the structure which comprises 
a virus particle and a binding molecule displayed at the 
viral surface a 'package'. Where the binding molecule 
is an antibody, an antibody derivative or fragment, or a 
domain that is homologous to an Immunoglobulin domain, 

35 the applicants call the package a 'phage antibody 1 
(pAb). However, except where the context demands 
otherwise, where the term phage antibody is used 
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generally, it should also be interpreted as referring to 
any package comprising a virus particle and a 
biologically functional binding molecule displayed at 
the viral surface. 
5 pAbs have a range of applications in selecting 

antibody genes encoding antigen binding activities. For 
example, pAbs could be used for the cloning and rescue 
of hybridomas (Orlandi, R. , et al (1989) PNAS 86 p3833- 
3837), and in the screening of large combinatorial 

10 libraries (such as found in Huse, W.D. et al., 1989, 
Science 246 , 1275-1281). In particular, rounds of 
selection using pAbs may help in rescuing the higher 
affinity antibodies from the latter libraries. It may 
be preferable to screen small libraries derived from 

15 antigen- selected cells (Casali, P., et al*, (1986) 

Science 234 p476-479) to rescue the original VH/VL pairs 
comprising the Fv region of an antibody. The use of 
pAbs may also allow the construction of entirely 
synthetic antibodies. Furthermore, antibodies may be 

20 made which have some synthetic sequences e.g. CDRs, and 
some naturally derived sequences. For example, V-gene 
repertoires could be made in vitro by combining un- 
rearranged V genes, with D and J segments. Libraries of 
pAbs could then be selected by binding to antigen, 

25 hypermutated in vitro in the antigen-binding loops or V 
domain framework regions, and subjected to further 
rounds of selection and mutagenesis. 

The demonstration that a functional antigen- 
binding domain can be displayed on the surface of phage, 

30 has implications beyond the construction of novel 

antibodies. For example, if other protein domains can 
be displayed at the surface of a phage, phage vectors 
could be used to clone and select genes by the binding 
properties of the displayed protein. Furthermore, 

35 variants of proteins, including epitope libraries built 
into the surface of the protein, could be made and 
readily selected for binding activities. In effect, 
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other protein architectures might serve as "nouvelle" 
antibodies . 

The technique provides the possibility of 
building antibodies from first principles, taking 
5 advantage of the structural framework on which the 
antigen binding loops fold. In general , these loops 
have a limited number of conformations which generate a 
variety of binding sites by alternative loop 
combinations and by diverse side chains. Recent 

10 successes in modelling antigen binding sites augurs well 
for de novo design. In any case, a high resolution 
structure of the antigen is needed. However, the 
approach is attractive for making e.g. catalytic 
antibodies, particularly for small substrates. Here 

15 side chains or binding sites for prosthetic groups might 
be introduced, not only to bind selectively to the 
transition state of the substrate, but also to 
participate directly in bond making and breaking. The 
only question is whether the antibody architecture, 

20 specialised for binding , is the best starting point for 
building catalysts. Genuine enzyme architectures, such 
as the triose phosphate isomerase (TIM) barrel, might be 
more suitable. Like antibodies, TIM enzymes also have a 
framework structure (a barrel of S-strands and a- 

25 helices) and loops to bind substrate. Many enzymes with 
a diversity of catalytic properties are based on this 
architecture and the loops might be manipulated 
independently on the frameworks for design of new 
catalytic and binding properties. The phage selection 

30 system as provided by the present disclosure can be used 
to select for antigen binding activities and the CDR 
loops thus selected, used on either an antibody 
framework or a TIM barrel framework. Loops placed on a 
e.g. a TIM barrel framework could, be further modified by 

35 mutagenesis and subjected to further selection. Thus, 
there is no need to select for high affinity binding 
activities in a single step. The strategy of the immune 
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system, in which low affinity evolves to high affinity 
seems more realistic and can be mimicked using this 
invention . 

One class of molecules that could be useful in 
5 this type of application are receptors. For example, a 
specific receptor could be displayed on the surface of 
the phage such that it would bind its ligand. The 
receptor could then be modified by, for example, in 
vitro mutagenesis and variants having higher binding 

10 affinity for the ligand selected. The selection may be 
carried out according to one or more of the formats 
described below. 

Alternatively , the phage -receptor could be used 
as the basis of a rapid screening system for the binding 

15 of ligands, altered ligands, or potential drug 

candidates. The advantages of this system namely of 
simple cloning, convenient expression, standard reagents 
and easy handling makes the drug screening application 
particularly attractive. In the context of this 

20 discussion, receptor means a molecule that binds a 

specific, or group of specific, ligand(s). The natural 
receptor could be expressed on the surface of a 
population of cells, or it could be the extracellular 
domain of such a molecule (whether such a form exists 

25 naturally or not), or a soluble molecule performing a 

natural binding function in the plasma, or within a cell 
or organ. 

Another possibility, is the display of an enzyme 
molecule or active site of an enzyme molecule on the 

30 surface of a phage (see examples 11,12,30,31,32 and 36 

of WO 92/01047). Once the phage enzyme is expressed, it 
can be selected by affinity chromatography, for instance 
on columns derivatized with transition state analogues. 
If an enzyme with a different or modified specificity is 

35 desired, it may be possible to mutate an enzyme 

displayed as a fusion on bacteriophage and then select 
on a column derivatised with an analogue selected to 
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have a higher affinity for an enzyme with the desired 
modified specificity. 

Although throughout this application, the 
applicants discuss the possibility of screening for 
5 higher affinity variants of pAbs, they recognise that in 
some applications, for example low affinity 
chromatography (Ohlson, S. et al Anal. Biochem. 169 , 
p204-208 (1988)), it may be desirable to isolate lower 
affinity variants. 

10 pAbs also allow the selection of antibodies for 

improved stability. It has been noted for many 
antibodies, that yield and stability are improved when 
the antibodies are expressed at 30 °C rather than 37 °C. 
If pAbs are displayed at 37°C f only those which are 

15 stable will be available for affinity selection. When 
antibodies are to be used in vivo for therapeutic or 
diagnostic purposes, increased stability would extend 
the half -life of antibodies in circulation. 

Although stability is important for all 

20 antibodies and antibody domains selected using phage, it 
is particularly important for the selection of Fv 
fragments which are formed by the non-covalent 
association of VH and VL fragments. Fv fragments have a 
tendency to dissociate and have a much reduced half-life 

25 in circulation compared to whole antibodies. Fv 

fragments are displayed on the surface of phage, by the 
association of one chain expressed as a gene III protein 
fusion with the complementary chain expressed as a 
soluble fragment. If pairs of chains have a high 

30 tendency to dissociate, they will be much less likely to 
be selected as pAbs. Therefore, the population will be 
enriched for pairs which do associate stably. Although 
dissociation is less of a problem with Fab fragments, 
selection would also occur for Fab fragments which 

35 associate stably. pAbs allow selection for stability to 
protease attack, only those pAbs that are not cleaved by 
proteases will be capable of binding their ligand and 
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therefore populations of phage will be enriched for 
those displaying stable antibody domains. 

The technique of displaying binding molecules on 
the phage surface can also be used as a primary cloning 
5 system. For example, a cDNA library can be constructed 
and inserted into the bacteriophage and this phage 
library screened for the ability to bind a ligand. The 
ligand/binding molecule combination could include any 
pair of molecules with an ability to specifically bind 

10 to one another e.g. receptor /ligand, enzyme/ substrate 

( or analogue ) , nucleic acid binding protein/nucleic acid 
etc. If one member of the complementary pair is 
available, this may be a preferred way of isolating a 
clone for the other member of the pair . 

15 The first functional antibody molecules to be 

expressed on the surface of filamentous phage were 
single-chain Fv's (scFv), so-called because heavy and 
light chain variable domains, normally on two separate 
proteins, are covalently joined by a flexible linker 

20 peptide. Alternative expression strategies have also 

been successful* Fab molecules can be displayed on phage 
if one of the chains (heavy or light) is fused to g3 
capsid protein and the complementary chain exported to 
the periplasm as a soluble molecule. The two chains can 

25 be encoded on the same or on different replicons; the 

important point is that the two antibody chains in each 
fab molecule assemble post-translationally and the dimer 
is incorporated into the phage particle via linkage of 
one of the chains to g3p. 

30 More recent cloning has been performed with 

'phagemid' vectors which have ca. 100-fold higher 
transformation efficiencies than phage DNA. These are 
plasmids containing the intergenic region from 
filamentous phages which enables single- stranded copies 

35 of the phagemid DNA to be produced, and packaged into 
infectious filamentous particles when cells harbouring 
them are infected with 'helper' phages providing the 
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phage components in -trans. When phagemids contain gill 
fused -bo an antibody gene (eg pHEN-1), the resulting 
fusion protein is displayed on the phagemid particle 
( Hoogenboom , H. R. , A. D . Griffiths , K . S . Johnson , D . 
5 J, Chiswell, P. Hudson and G. Winter. (1991). 

Multi-subunit proteins on the surface of filamentous 
phage: methodologies for displaying antibody (Fab) heavy 
and light chains. Nucleic Acids Res. 19 (15), 
4133-4137 ) . Efficient strategies have been developed for 

10 cloning antibody genes, a factor which becomes most 

important when dealing with large numbers of different 
antibody fragments such as repertoires. 

The cloning vector fd-DOG-1 was used in early 
work with phage antibody repertoires in which scFv 

15 fragments were derived from spleen mRNA of mice 

immunised with the hapten oxazalone (Clackson, T., H. R. 
Hoogenboom,. A. D. Griffiths and G. Winter. (1991). 
Making antibody fragments using phage display libraries. 
Nature. 352 , 624-628.); VH and VL domains were 

20 separately amplified then linked at random via a short 

DNA fragment encoding the scFv linker peptide to produce 
a library of approxiamtely 10 5 different clones. This was 
panned against the immunising antigen to select 
combinations of VH and VL which produced functional 

25 antibodies. Several binders were isolated, one in 

particular having an affinity not far below that of the 
best monoclonal antibodies produced by conventional 
hybridoma technology. 

In a mouse, at any one time there are 

30 approximately 10 7 possible H chains and 10 5 possible L 
chains, making a total of 101 2 possible VH:VL 
combinations when the two chains are combined at random 
(these figures are estimates and simply provide a rough 
guide to repertoire size). By these figures, the above 

35 mouse library sampled only 1 in 10 7 of the possible VH: VL 
combinations. It is likely that good affinity antibodies 
were isolated in the work described in the preceeding 
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paragraph because the spleen cells derived from an 
immunised donor in which B cells capable of recognising 
the antigen are clonally expanded and producing large 
quantities of Ig mRNA. The low library complexity in 
5 this experiment is partly due to the intrinsically low 
transformation efficiency of phage DNA compared to 
plasmid ( or phagemid ) . 

Marks et al. (Marks, J.D., Hoogenboom, H. R. , 
Bonnert, T.P., McCafferty, J., Griffiths, A.D. and 

10 Winter, G. (1991) By-passing immunization: Human 

antibodies from V-gene libraries displayed on phage. J. 
Mol. Biol. 222, 581-597) and W092/01047 describe 
construction of an antibody repertoire from unimmunised 
humans cloned in the phagemid pHEN-1. This library, 

15 consisting of 3.10 7 clones has so far yielded specific 

antibodies to many different antigens. These antibodies 
tend to have the moderate affinities expected of a 
primary immune response, demonstrating that usable 
antibodies to a range of structurally diverse antigens 

20 can indeed be isolated from a single resource. 

New binders can be created from clones isolated 
from phage antibody libraries using a procedure called 
1 chain- shuffling 1 . In this process one of the two chains 
is fixed and the other varied. For example, by fixing 

25 the heavy chain from the highest affinity mouse anti-OX 
phage antibody and recloning the repertoire of light 
chains alongside it, libraries of 4.10 7 were constructed. 
Several new OX-binders were isolated, and the majority 
of these had light chains that were distinct from those 

30 first isolated and considerably more diverse. These 
observations reflect the fact that a small library is 
sufficient to tap the available diversity when only one 
chain is varied, a useful procedure if the original 
library was not sufficiently large to contain the 

35 available diversity. 

The size of the library is of critical 
importance. This is especially true when attempting to 
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Isolate antibodies from a naive human repertoire, but is 
equally relevant to isolation of the highest affinity 
antibodies from an immunised source. 

It is clear that while phage display is an 
5 exceptionally powerful tool for cloning and selecting 

antibody genes, we are tapping only the tiniest fraction 
of the potential diversity using existing technology. 
Transformation efficiencies place the greatest 
limitation on library size with 10 9 being about the limit 
10 using current methods. Rough calculations suggest that 
this is several orders of magnitude below the target 
efficiency; more rigourous analysis confirms it. 

Per el son and Oster have given theoretical 
consideration to the relationship between size of the 
15 immune repertoire and the likelihood of generating an 
antibody capable recognising a given epitope with 
greater than a certain threshold affinity, K. The 
relationship is described by the equation: 

P- e-*(*[ K ] ) 

20 Where P = probability that an epitope is not 

recognised with an affinity above the threshold value K 

by any antibody in the repertoire, 

N = number of different antibodies in the 

repertoire, and 
25 p[K]= probability that an individual antibody 

recognises a random epitope with an affinity above the 

threshold value K 

Xn this analysis p[K] is inversely proportional 
30 to affinity, although an algorithm describing this 

relationship precisely has not been deduced. Despite 
this, it is apparent that the higher the affinity of the 
antibody, the lower its p[K] and the larger the 
repertoire needs to be to achieve a reasonable 
35 probability of isolating that antibody. The other 

important feature is that the function is exponential; 
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as shown in fig 1, a small change in library size can 
have either a negligible or a dramatic effect: on the 
probability of isolating an antibody with a given p[K] 
value , depending upon what point on the curve is given 
5 by the library size. 

WO 92/01047 and W092/20791 describe how the 
limitations of transformation efficiency (and therefore 
the upper limit on library size) can be overcome by use 
of other methods for introducing DNA into cells, such as 

10 infection. In one configuration, heavy and light chain 
genes are cloned separately on two different replicons, 
at least one of which is capable of being incorporated 
into a filamentous particle. Infectious particles 
carrying one chain are infected into cells harbouring 

15 the complementary chain; infection frequencies of >90% 

can be readily achieved. Heavy and light chains are then 
able to associate post-translationally in the periplasm 
and the combination displayed on the surface of the 
filamentous particle by virtue of one or both chains 

20 being connected to g3p. For example, a library of 10 7 
heavy chains is cloned as an unfused population in a 
phagemid, and 10 7 light chains are cloned as g3 fusions 
in fd-DOG-1. Both populations are then expanded by 
growth such that there are 10 7 of each heavy 

25 chain-containing cell and 10 7 copies of each light chain 
phage. By allowing the phage to infect the cells, 10 7 x 
10 7 = 10 14 unique combinations can be created, because 
there are 10 7 cells carrying the same heavy chain which 
can each be infected by 10 7 phage carrying different 

30 light chains. When this is repeated for each different 
heavy chain clone then one ends up with up to 10 14 
different heavy/light combinations in different cells. 
This strategy is outlined in fig 2, which shows the 
heavy chain cloned as g3 fusions on phage and the light 

35 chains expressed as soluble fragments from a phagemid. 
Clearly, the reverse combination, light chains on phage, 
heavy chain on phagemid, is also tenable. 
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In -the configuration shown in fig 2, fd-DOG 
f rescues ' "the phagemid so that: both phage and phagemid 
DNA is packaged into filamentous particles, and both 
types will have paired heavy and light chains on their 
5 surface, despite having the genetic information for only 
one of them. For a given antigen or epitope, the vast 
majority of the heavy and light chain pairings will be 
non- functional (ie. will not bind that antigen or 
epitope ) , so that selection on antigen will have the 

10 effect of vastly reducing the complexity of the heavy 
and light chain populations. After the first round of 
selection the clones are re-assorted, for example by 
infecting fresh host cells and selecting for both 
replicons. After several rounds of antigen selection and 

15 recovery of the two repliconsv the considerably reduced 
heavy and light chain populations can be cloned onto the 
same replicon and analysed by conventional means. 
Selection from the, say, 10 14 combinations produces a 
population of phages displaying a particular combination 

20 of H and L chains having the desired specificity. The 
phages selected however, will only contain DNA encloding 
one partner of the paired H and L chains. Selection for 
the two replicons may be as follows. Vectors of the H 
chain library may encode tetracycline resistance, with 

25 vectors of the D chain library encoding ampicillin 

resistance. The sample elute containing the population 
is divided into two portions. A first portion is grown 
on e.g. tetracycline plates to select those 
bacteriophage containing DNA encoding H chains which are 

30 involved in the desired antigen binding. A second 
portion is grown on e.g. ampicillin plates to select 
those bacteriophage containg phagemid DNA encoding L 
chains which are involved in the desired antigen 
binding. A set of colonies from individually isolated 

35 clones e.g. from the tetracycline plates are then used 
to infect specific colonies e.g. from the ampicillin 
plates. This results in bacteriophage expressing 
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specific combinations of H and L chains which can then 
be assayed for antigen binding. 

One technical problem with the use of separate 
replicons for VL and VH chains is so-called 
'interference 1 between filamentous phage origins of 
replication carried on different replicons as a result 
of competition for the same replication machinery. 

Procedures have been described which work on the 
principle of first reducing the complexity of a 
repertoire then recloning one or both chains of the 
reduced population ( WO92/20791 ) . The present invention 
provides a different approach. 
TERMINOLOGY 

Much of the terminology discussed in this section 
has been mentioned in the text where appropriate. 
Specific Binding Pair (sbp) 

This describes a pair of molecules (each being a 
member of a specific binding pair) which are naturally 
derived or synthetically produced. One of the pair of 
molecules, has an area on its surface, or a cavity which 
specifically binds to, and is therefore defined as 
complementary with a particular spatial and polar 
organisation of the other molecule, so that the pair 
have the property of binding specifically to each other. 
Examples of types of specific binding pairs are antigen- 
antibody, biotin-avidin, hormone -hormone receptor, 
receptor-ligand, enzyme- substrate, IgG-protein A. 
Multimeric Member 

This describes a first polypeptide which will 
associate with at least a second polypeptide, when the 
polypeptides are expressed in free form and/or on the 
surface of a substrate. The substrate may be provided 
by a bacteriophage. Where there are two associated 
polypeptides, the associated polypeptide complex is a 
dimer, where there are three, a trimer etc. The dimer, 
trimer, multimer etc or the multimeric member may 
comprise a member of a specific binding pair. 
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Example multimeric members are heavy domains 
based on an Immunoglobulin molecule, light: domains based 
on an immunoglobulin molecule, T-cell receptor subunits. 
Replicable Genetic Display Package (Rgdp) 
5 This describes a biological particle which has 

genetic information providing the particle with the 
ability to replicate. The particle can display on its 
surface at least part of a polypeptide. The polypeptide 
can be encoded by genetic information native to the 

10 particle and/or artificially placed into the particle or 
an ancestor of it. The displayed polypeptide may be any 
member of a specific binding pair eg. heavy or light 
chain domains based on an immunoglobulin molecule, an 
enzyme or a receptor etc. 

15 The particle may be a virus eg. a bacteriophage 

such as fd or M13. 
Package 

This describes a replicable genetic display 
package in which the particle is displaying a member of 
20 a specific binding pair at its surface. The package may 
be a bacteriophage which displays an antigen binding 
domain at its surface. This type of package has been 
called a phage antibody (pAb). 
Antibody 

25 This describes an immunoglobulin whether natural 

or partly or wholly synthetically produced. The term 
also covers any protein having a binding domain which is 
homologous to an immunoglobulin binding domain. These 
proteins can be derived from natural sources, or partly 

30 or wholly synthetically produced. 

Example antibodies are the immunoglobulin 
isotypes and the Fab, F(ab 1 ) 2 , scFv, Fv, dAb, Fd 
fragments . 

Immunoglobulin Super family 
35 This describes a family of polypeptides, the 

members of which have at least one domain with a 
structure related to that of the variable or constant 



WO 93/19172 



PCT/GB93/00605 



15 

domain of immunoglobulin molecules. The domain contains 
two S-sheets and usually a conserved disulphide bond 
(see A. F. Williams and A.N. Barclay 1988 Ann. Rev 
Immunol . 5. 
5 381-405). 

Example members of an immunoglobulin superfamily 
are CD4, platelet derived growth factor receptor 
(PDGFR), intercellular adhesion molecule. (ICAM). 
Except where the context otherwise dictates, reference 
10 to immunoglobulins and immunoglobulin homologs in this 
application includes members of the immunoglobulin 
superfamily and homologs thereof. 
Homoloas 

This term indicates polypeptides having the same 
15 or conserved residues at a corresponding position in 
their primary, secondary or tertiary structure. The 
term also extends to two or more nucleotide sequences 
encoding the homologous polypeptides . 

Example homologous peptides are the 
20 immunoglobulin isotypes. 
Functional 

In relation to a sbp member displayed on the 
surface of a rgdp, means that the sbp member is 
presented in a folded form in which its specific binding 

25 domain for its complementary sbp member is the same or 
closely analogous to its native configuration, whereby 
it exhibits similar specificity with respect to the 
complementary sbp member. In this respect, it differs 
from the peptides of Smith et al, supra, which do not 

30 have a definite folded configuration and can assume a 
variety of configurations determined by the 
complementary members with which they may be contacted. 
Genetically diverse population 

in connection with sbp members or polypeptide 

35 components thereof, this is referring not only to 

diversity that can exist in the natural population of 
cells or organisms, but also diversity that can be 
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created by artificial mutation in vitro or in vivo . 

Mutation in vitro may for example,, involve random 
mutagenesis using oligonucleotides having random 
mutations of the sequence desired to be varied. In vivo 
5 mutagenesis may for example, use mutator strains of host 
microorganisms to harbour the DNA (see Example 38 of WO 
92/01047). The word "population" itself may be used to 
denote a plurality of e.g. polypeptide chains, which are 
not genetically diverse i.e. they are all the same. 
10 Domain 

A domain is a part of a protein that is folded 
within itself and independently of other parts of the 
same protein and independently of a complementary 
binding member. 

15 Folded Unit 

This is a specific combination of an a-helix 
and/or B-strand and/or S-turn structure. Domains and 
folded units contain structures that bring together 
amino acids that are not adjacent in the primary 

20 structure. 
Free Form 

This describes the state of a polypeptide which 
is not displayed by a replicable genetic display 
package. 
25 Conditionally Defective 

This describes a gene which does not express a 
particular polypeptide under one set of conditions , but 
expresses it under another set of conditions. An 
example f is a gene containing an amber mutation 
30 expressed in non- suppressing or suppressing hosts 
respectively . 

Alternatively , a gene may express a protein which 
is defective under one set of conditions , but not under 
another set. An example is a gene with a temperature 
35 sensitive mutation. 

Suppressible Trans la'tional Stop Codon 

This describes a codon which allows the • 
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translation of nucleotide sequences downstream of the 
codon under one set of conditions, but under another set 
of conditions translation ends at the codon. Example of 
suppressible translational stop codons are the amber, 
5 ochre and opal codons. 
Mutator Strain 

This is a host cell which has a genetic defect 
which causes DNA replicated within it to be mutated with 
respect to its parent DNA. Example mutator strains are 
10 NR9046mutD5 and NR9046 mut Tl (see Example 38). 
Helper Phage 

This is a phage which is used to infect cells 
containing a defective phage genome and which functions 
to complement the defect. The defective phage genome 
15 can be a phagemid or a phage with some function encoding 
gene sequences removed. Examples of helper phages are 
M13K07, M13K07 gene III no. 3; and phage displaying or 
encoding a binding molecule fused to a capsid protein. 
Vector 

20 This is a DNA molecule, capable of replication in 

a host organism, into which a gene is inserted to 

construct a recombinant DNA molecule. 

Phage Vector 

This is a vector derived by modification of a 
25 phage genome, containing an origin of replication for a 

bacteriophage, but not one for a plasmid. 

Phagemid Vector 

This is a vector derived by modification of a 

plasmid genome, containing an origin of replication for 
30 a bacteriophage as well as the plasmid origin of 

replication . 

Secreted 

This describes a rgdp or molecule that associates 
with the member of a sbp displayed on the rgdp, in which 
35 the sbp member and/or the molecule, have been folded and 
the package assembled externally to the cellular 
cytosol. 
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Repertoire of Rearranged Immunoglobulin Genes 

A collection of naturally occurring nucleotides 
eg DNA sequences which encoded expressed immunoglobulin 
genes in an animal. The sequences are generated by the 
5 in vivo rearrangement of eg V, D and J segments for H 
chains and eg the V and J segments for L chains. 
Alternatively the sequences may be generated from a cell 
line immunised in vitro and in which the rearrangement 
in response to immunisation occurs intracellular ly. The 
10 word "repertoire" is used to indicate genetic diversity. 
Library 

A collection of nucleotide eg DNA, sequences 
within clones; or a genetically diverse collection of 
polypeptides, or specific binding pair members, or 
15 polypeptides or sbp members displayed on rgdps capable 
of selection or screening to provide an individual 
polypeptide or sbp members or a mixed population of 
polypeptides or sbp members. 

Repertoire of Artificially Rearranged Immunoglobulin 
20 Genes 

A collection of nucleotide eg DNA, sequences 
derived wholly or partly from a source other than the 
rearranged immunoglobulin sequences from an animal. 
This may include for example, DNA sequences encoding VH 
25 domains by combining unrear ranged V segments with D and 
J segments and DNA sequences encoding VL domains by 
combining V and J segments. 

Part or all of the DNA sequences may be derived 
by oligonucleotide synthesis. 
30 Secretory Leader Peptide 

This is a sequence of amino acids joined to the 
N-terminal end of a polypeptide and which directs 
movement of the polypeptide out of the cytosol. 
Eluant 

35 This is a solution used to breakdown the linkage 

between two molecules. The linkage can be a non- 
covalent or covalent bond( s ) . The two molecules can be 
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members of a sbp. 
Derivative 

This is a substance which derived from a 
polypeptide which is encoded by the DNA within a 
5 selected rgdp. The derivative polypeptide may differ 
from the encoded polypeptide by the addition, deletion, 
substitution or insertion of amino acids, or by the 
linkage of other molecules to the encoded polypetide. 
These changes may be made at the nucleotide or protein 
10 level. For example the encoded polypeptide may be a Fab 
fragment which is then linked to an Fc tail from another 
source. Alternatively markers such as enzymes, 
flouresceins etc may be linked to eg Fab, scFv 
fragments . 

15 According to one aspect of the present invention 

there is provided a method for producing multimeric 
specific binding pair (sbp) members, which method 
comprises 

causing or allowing recombination between (a) 

20 first vectors comprising nucleic acid encoding a 

population of a fusion of a first polypeptide chain of a 
specific binding pair member and a component of a 
replicable genetic display package (rgdp) and (b) second 
vectors comprising nucleic acid encoding a population of 

25 a second polypeptide chain of a specific binding pair 
member, at least one of said populations being 
genetically diverse, the recombination resulting in 
recombinant vectors each of which comprises nucleic acid 
encoding a said polypeptide fusion and a said second 

30 polypeptide chain and capable of being packaged into 
rgdps using said rgdp component. 

One or other or both of the populations of first 
and second polypeptide chains may be genetically 
diverse. Where both are genetically diverse, the 

35 recombinant vectors will represent an enormously diverse 
repertoire of sbp members. Either or both of the 
populations may be genetically diverse but restricted 
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compared with, the full repertoire available, perhaps by 
virtue of a preceding selection or screening step. A 
library of nucleic acid encoding a restricted population 
of polypeptide chains may be the product of selection or 
5 screening using rgdp display. 

According to another aspect of the invention 
there is provided a method of producing multimeric 
specific binding pair (sbp) members, which method 
comprises: 

10 (i) expressing from a vector in recombinant host 

organism cells a population of a first polypeptide chain 
of a specific binding pair member fused to a component 
of a replicable genetic display package (rgdp) which 
thereby displays said polypeptide chains at the surface 

15 of rgdps, and combining said population with a 
population of a second polypeptide chain of said 
specific binding pair member by causing or allowing 
first and second polypeptide chains to come together to 
form a library of said multimeric specific binding pair 

20 members displayed by rgdps, said population of second 
polypeptide chains not being expressed from the same 
vector as said population of first polypeptide chains, 
at least one of said populations being genetically 
diverse and expressed from nucleic acid that is capable 

25 of being packaged using said rgdp component, whereby the 
genetic material of each said rgdp encodes a polypeptide 
chain of a said genetically diverse population; 

(ii) selecting or screening rgdps formed by said 
expressing to provide an individual sbp member or a 

30 mixed population of said sbp members associated in their 
respective rgdps with nucleic acid encoding a 
polypeptide chain thereof; 

(iii) obtaining nucleic acid from a selected or 
screened rgdp, the nucleic acid obtained being one of 

35 (a) nucleic acid encoding a first polypeptide chain, (b) 
nucleic acid encoding a second polypeptide chain, and 
( c ) a mixture of ( a ) and ( b ) ; 
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(iv) producing a recombinant vector by causing or 
allowing recombination between (a) a vector comprising 
nucleic acid obtained in step (iii) encoding a first 
polypeptide chain and a vector comprising nucleic acid 
5 encoding a second polypeptide chain, or (b) a vector 
comprising nucleic acid encoding a first polypeptide 
chain and a vector comprising nucleic acid obtained in 
step (iii) encoding a second polypeptide chain. 
The recombination may take place 

10 intracellularly or in vitro , although it is preferable 
that it takes place in recombinant host cells. This is 
discussed elsewhere, but briefly this may involve 
introducing a library of vectors including nucleic acid 
encoding first (or second) polypeptide chain components 

15 of sbp member into host cells harbouring a library of 
vectors comprising nucleic acid encoding second (or 
first) polypeptide chain components of sbp members. 

Following the recombination the polypeptide 
fusions (first polypeptide chains fused to a rgdp 

20 component) and the second polypeptide chains may be 
expressed, producing rgdps which display at their 
surface said first and second polypeptide chains and 
which each comprise nucleic acid encoding a said first 
polypeptide chain and a said second polypeptide chain , 

25 by virtue of the packaging of the recombinant vectors 
into rgdps. This expression may therefore produce an 
extremely diverse library of sbp members displayed on 
rgdp. In one embodiment, the rgdps displaying sbp 
member are pAbs ( ie phage displaying antibodies or 

30 antibody fragments or derivatives ) , and those which bind 
antigen of interest may be selected using their binding 
capability. Since each pAb contains within it nucleic 
acid encoding both polypeptide chains of the antibody 
displayed on its surface, pAbs selected by binding to an 

35 antigen of interest will provide nucleic acid encoding 
an antibody which binds that antigen. The nucleic acid 
may be isolated from the selected pAbs and used in 
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subsequent obtention of desired antibodies, after any 
amplification and cloning required in a given case. 

The recombination may be promoted by inclusion in 
the vectors of sequences at which site- specific 
5 recombination will occur. This enables accurate design 
of the resultant recombinant vectors. For instance, a 
sequence at which site-specific recombination will occur 
may be position in the nucleic acid which encodes a 
polypeptide linker which joins the two domains of a 

10 single chain sbp member. The single chain sbp member 
may consist of an immunoglobulin VH domain linked to an 
immunoglobulin VL domain. VH and VL domains may 
associate to form an antigen binding site. The 
resultant recombinant vector may then comprise nucleic 

15 acid encoding a single chain Fv derivative of an 

immunoglobulin resulting from recombination between 
first and second vectors. (Note: a single chain sbp 
member, such as a scFv fragment or derivative of an 
antibody, may be considered to be multimeric (dimeric) 

20 because it consists of two polypeptide chain domains, 
such as VL and VH of an antibody. ) 

The sequences at which site-specific 
recombination will occur may be loxP sequences 
obtainable from coliphage PI, with site-specific 

25 recombination catalysed by Cre-recombinase, also 
obtainable from coliphage PI. The site-specific 
recombination sequences used may be derived from a loxP 
sequence obtainable from coliphage PI. 

The Cre-recombinase used may be expressible under 

30 the control of a regulatable promoter. 

In order to increase the efficiency of the 
method, increasing the proportion of productive 
recombination leading to the resultant recombinant 
vectors desired, each vector may include two site- 

35 specific recombination sequences each of which is 

different from the other. The sequences should then be 
such that recombination will take place between like 
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sequences on different vectors but not between the 
different sequences on the same vector. 

Each of the first vectors and each of the second 
vectors may include a first site- specific recombination 
5 sequence and a second site-specific recombination 
sequence different from the first, site-specific 
recombination taking place between first site-specific 
recombination sequences on different vectors and between 
second site-specific recombination sequences on 
10 different vectors but not between a first site-specific 
recombination sequence and a second site- specific 
recombination sequence on the same vector. 

The first site- specific recombination sequence 
may be loxP obtainable from coliphage PI and the second 
15 site-specific recombination sequence a mutant loxP 

sequence, or vice versa. Potentially, both the first 
and second site- specific recombination sequences may be 
mutants, as long as the first sequence will not 
recombine with the second sequence but first sequences 
20 will recombine with each other and second sequences 
will recombine with each other. 

A suitable mutant loxP sequence is loxP 511. 
The first vectors may be phages or phagemids and 
the second vectors plasmids, or the first vectors may be 
25 plasmids and the second vectors phages or phagemids. 
In one embodiment, the recombination is 
intracellular and takes place in a bacterial host which 
replicates the recombinant vector preferentially over 
the first vectors and the second vectors. This may be 
30 used to enrich selection of successful recombination 

events. The intracellular recombination may take place 
in a bacterial host which replicates plasmids 
preferentially over phages or phagemids, or which 
, replicates phages or phagemids preferentially over 
35 plasmids. For instance, the bacterial host may be a 
PolA strain of E.coli or of another gram-negative 
bacterium. PolA cells are unable to support replication 
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of plasmids r but: can support replication of filamentous 
phage and phagemids (plasmids containing filamentous 
phage intergenic regions). So, for instance, if the 
first vectors are plasmids containing a first marker 
5 gene, and the second vectors are phage or phagemids 
containing a second marker gene, selection for both 
markers will yield recombinant vectors which are the 
product of a successful recombination event, since 
recombination transferring the first marker from plasmid 

10 must take place in order for that marker to be 
replicated and expressed. 

Nucleic acid from one or more rgdp's may be taken 
and used in a further method to obtain an individual sbp 
member or a mixed population of sbp members, or 

15 polypeptide chain components thereof, or encoding 
nucleic acid therefor. 

The present invention also provides a kit for use 
in carrying out methods provided, having: 

(i) a first vector having a restriction site 
20 for insertion of nucleic acid encoding or a polypeptide 

component of an sbp member, said restriction site being 
in the 5 r end region of the mature coding sequence of a 
phage capsid protein, with a secretory leader sequence 
upstream of said site which directs a fusion of the 
25 capsid protein and sbp polypeptide to the periplasmic 
space of a bacterial host; and 

(ii) a second vector having a restriction site 
for insertion of nucleic acid encoding a second said 
polypeptide chain, 

30 at least one of the vectors having an origin of 

replication for single-stranded bacteriophage, the 
vectors having sequences at which site -specific 
recombination will occur. 

The kit may contain ancillary components needed 
35 for working the method. 

Also provided by the present invention are 
recombinant host cells harbouring a library of first 
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vectors each comprising nucleic acid encoding a first: 
polypeptide chain of a sbp member fused to a component 
of a secretable replicable genetic display package 
(rgdp) and second vectors each comprising nucleic acid 
5 encoding a second polypeptide chain of a sbp member, the 
first vectors or the second vectors or both being 
capable of being packaged into rgdps using the rgdp 
component, and the vectors having sequences at which 
site-specific recombination will occur. 

10 According to another aspect of the present 

invention there is providedn a population of rgdps each 
displaying at its surface a sbp member and each 
containing nucleic acid which encodes a first and a 
second polypeptide chain of the sbp member displayed at 

15 its surface and which includes a site-specific 
recombination sequence . 

According to another aspect of the invention 
there is provided a population of rgdps each displaying 
at its surface a sbp member and each containing nucleic 

20 acid which comprises a combination of (i) nucleic acid 
encoding a first polypeptide chain of a sbp member and 
(ii) nucleic acid encoding a second poypeptide chain of 
a sbp member, the population containing 10 10 or more 
combinations of ( i ) and ( ii ) . Such a population exceeds 

25 in size the maximum which is achievable using available 
techniques. The present invention enables production of 
enormously diverse libraries or populations of rgdps 
displaying sbp members. The nucleic acid encoding a 
first polypeptide chain of a sbp member may have, for 

30 instance, 10 7 different sequences throughout the 

population. Where the nucleic acid encoding a second 
polypeptide chain of a sbp member also has such a 
genetic diversity throughout the population, the number 
of different combinations of nucleic acid encoding first 

35 and second polypeptide chains is immense. 
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Embodiments of "the present invention will now be 
described in more detail by way of example only and not 
by way of limitation,, with reference to the figures. 

5 BRIEF DESCRIPTION OF THE FIGURES 

Fig. 1 shows plots of the probability of isolating 
an antibody with a given p[K] value against the size of 
a library. 

Fig. 2 outlines a strategy to clone heavy chain as 

10 g3 fusion on phage, light chain being expressed as 
soluble fragments from a phagemid. 

Fig. 3 (i) and (ii) illustrates the use of sites 
specific recombination for construction of 
polycombinantorial libraries . 

15 Fig 4A shows replicons generated by Cre mediated 

recombination between the acceptor phage vector 
fdD0G-21ox (A) and the donor plasmid vector pUC19-21ox 
(B).^ A is based on f d-tet-DOGl , with Vic from the mouse 
anti-phOx antibody NQ10.12.5 linked to a human Ck 

20 constant domain, and VH from the mouse anti-TNFa 

antibody linked to a human Cml constant domain. B is 
based on pUC19 A with VH of NQ10.12.5 linked to the human 
Cgl constant domain. Within E. coli an equilibrium 
between the six vectors develops due to the reversible 

25 nature of recombination in the lox-Cre system. 

Ribosome-binding sites (small open circles ) r c-myc 
peptide tag (myc), phage fd gene III leader peptide 
sequence ( Lg3 ) r pelB leader peptide sequence ( LpelB ) r f d 
phage gene III (gill) and locations of oligonucleotides 

30 used for hybridisation and screening are indicated. 

Fig 4B shows the sequence across the wild- type loxP 
and mutant loxP 511 sites present in fdDOG-21ox (A) and 
pUC19-21ox (B). The inverted repeats in the loxP sites 
are boxed and the position of the point mutation in the 
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mutant loxP 511 site is indicated (#), as are the 
ribosome-binding sites (r.b.s.). Note that the wild-type 
loxP sites are in frame to ensure that the heavy chains 
immediately upstream can be fused to gene 111 for 
display on phage. 

Fig. 5 shows schematically selection techniques 
which utilise the unique properties of pAbs; 5(i) shows 
a binding/elution system; and 5(ii) shows a competition 
system (p=pAb; ag=antigen to which binding by pAb is 
required; c=competitor population e.g. antibody, pAb, 
ligands; s=substrate (e.g. plastic beads etc); 
d=detection system. 

Disclosed here are methods useful for preparing 
extremely diverse libraries of specific binding pair 
members, such as antibody heavy and light chains. Heavy 
and light chains cloned on separate replicons may be 
introduced into host cells. The heavy and light chain 
genes are recombined onto the same replicon such that 
the final number of combinations created is the number 
of heavy chains multiplied by the number of light 
chains. Recombination can occur in vivo or in vitro . 
Preferably, the recipient replicon is capable of being 
incorporated into an rgdp such that functional 
combinations of heavy and light chain genes can be 
selected. Such a format is particularly advantageous for 
construction of extremely diverse libraries of antibody 
heavy and light chains, for example, from unimmunised 
donors, immunised donors or a repertoire of an 
artificially rearranged immunoglobulin gene or genes, 
and is also convenient for chain-shuf fling, mutagenesis, 
humanising and CDR 'imprinting'. 

These methods can also be applied to other proteins 
in which two or more subunits assemble to create a 
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functional oligomer. 

The genes for both subunits present on two separate 
replicons can be brought together onto the same rgdp 
such that favourable combinations of subunit genes may 
5 be isolated directly without recourse to extensive 

recloning. This may be achieved by recombination between 
the replicons once they have been introduced into the 
same cell. In a preferred configuration,, recombination 
events are effected such that the genes for one of the 

10 chains is recombined onto a recipient replicon which 
contains the gene for a partner chain. Preferably , the 
recipient replicon is capable of being packaged into an 
rgdp. Most preferably, the genes encoding one or more of 
the subunit s is fused to a capsid gene such as gill in 

15 order that the functional multimer can be displayed on 
the surface of the rgdp. 

A variety of recombination systems are known, and 
many .of these could be harnessed in such a way as to 
effect recombination between replicons. Example 

20 recombination systems include general recombination, 
transposition and site- specific recombination. 

General recombination is a process whereby genetic 
exchange occurs between DNA segments that share some 
homology, and is also known as 'homologous 

25 recombination 1 . It is the principal mechanism by which 
genetic material is transferred between chromosones, and 
in E.coli the process is catalysed by the rec BCD enzyme 
(In "Escherichia coli and Salmonella typhimurium. 
Cellular and Molecular Biology. "( 1987 ) . ppl034-1043. 

30 Neidhart, F.C. Editor in Chief. American Society for 

Microbiology) . A general recombination mechanism could 
be used to transfer genes from one replicon to the other 
if, for example r the rgdp genome has a gene for one of 
the chains and a 1 dummy T partner chain gene . such that 
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recombination would have to occur to replace the dummy 
gene on the rgdp replicon with the functional gene on 
the second replicon in order to produce a functional 
pairing. 

5 Transposition could also be used to effect transfer 

of genetic information from one replicon to another ( In 
"Escherichia coli and Salmonella typhimurium. Cellular 
and Molecular Biology ."( 1987 ) • ppl061-1070. Neidhart, 
F.C. Editor in Chief. American Society for 

10 Microbiology). Transposons such as Tn 3 and Tn 10 are 
DNA segments that have also been called 'jumping genes' 
and 'selfish DNA 1 and are found on plasmids and in the 
E.coli chromosome. Transposon structure is variable, but 
usually comprises recombinase genes flanked by repeated 

15 DNA sequences; the recombinase(s) together with host 

factors catalyse insertion of the transposon into sites 
on the chromosone, by a mechanism which usually results 
in a duplication of site at which the transposon has 
inserted. Insertion by some transposons can be highly 

20 site-specific wheras others insert essentially at 

random. For the purpose of transferring genes from one 
replicon to another, the donor gene could be 
incorporated within a highly site specific transposon 
such as Tn 7. The recipient plasmid would be engineered 

25 to contain the target DNA sequence. 

One of the most fully understood site- specific 
recombination systems is that used in integration and 
excision of bacteriophage lambda (In "Escherichia coli 
and Salmonella typhimurium. Cellular and Molecular 

30 Biology. "(1987) . ppl054-1060. Neidhart, F.C. Editor in 
Chief. American Society for Microbiology). This 
bacteriophage can follow two developmental pathways once 
inside the cell: lysis or lysogeny. The lysogenic 
pathway involves integration of the lambda genome into 
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■the chromosome of the Infected bacterium; integration is 
the result of a site-specific recombination between a 
ca. 240bp sequence in the bacteriophage called att P and 
a 25bp site in the bacterial chromosone called att B. 
5 The integration event is catalysed by a host encoded 
factor called IHF and a phage encoded enzyme called Xnt 
recombinase, which recognises a 15bp region common to 
the two att sites. The integrated DNA is flanked by 
sequences derived from att B and att P, and these are 

10 called att L and att R. The integration event is 

reversible and is catalysed by Int, IHF and a second 
bacteriophage encoded enzyme,, Xls. It is envisaged that 
this system could be used for sequence transfer between 
replicons within E.coli. For example , the donor gene 

15 could be flanked by att L and att R sites such that when 
Int and Xis proteins are provided in the host cell, 
recombination between att L and att R sites would 
create a circular DNA segment containing the donor gene 
and a recreated att B site. This circular segment could 

20 then recombine with an att P site engineered into the 
recipient plasmid. 

An alternative site specific recombination system 
is the lox P/Cre recombinase system of coliphage PI 
(Hoess, R.H. and Abremski, K. (1990) The Cre-lox 

25 recombination system. In 'Nucleic acids and Molecular 
Biology.' Eckstein, F. and Lilley, D.M.J, eds. Vol 4, 
pp99-109, Springer- Ver lag, Berlin, Heidelberg). 
Cre— recombinase catalyses a highly specific 
recombination event at sequences called lox. lox P, the 

30 recombination site in phage PI consists of two 13bp 
inverted repeats separated by an 8bp non- symmetrical 
core ( fig 3 ) . For the work descended in this 
application, the lox P/Cre system was chosen of the 
alternatives available because the recombination is 
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highly sequence- specif ic, very efficient and occurs at a 
short target site that is readily incorporated into 
cloning vectors. 

In the example outlined configuration in fig 3 
5 soluble light chain is cloned onto a phagemid containing 
a single lox P site. The heavy chains are cloned onto a 
plasmid as g3 fusions. Alongside the g3 fusion is the 
gene for a selectable marker, and the 
heavychain/g3 /marker sequence flanked by two lox P 

10 sites. This plasmid also contains the Cre recombinase on 
a regulatable promoter and has an origin of 
double- stranded replication that is compatible with that 
on the phagemid in addition to that on the helper phage 
e.g. pl5A, RSF 1010 and col El origins will co-exist in 

15 the same cell. The phagemids are then infected into 
cells containing the donor plasmid and the Cre 
recombinase promoter induced, so that recombination 
between the lox P sites occurs inside infected cells. 
Some of these recombination events will lead to the 

20 heavychain/g3/marker sequences transferring as a block 
onto the phagemid at its single lox P site. Phagemids 
are then rescued with a helper phage such as M13K07 (see 
W092/01047 )and the resulting phagemid particles either 
directly selected on antigen or infected into fresh host 

25 cells and grown with selection for the presence of both 
markers; one from the phagemid itself and the other from 
the heavychain/g3 /marker block. 

The use of site-specific recombination to bring 
genes onto the same replicon may be extended to creation 

30 of a continuous coding sequence on the same replicon, 
for example to construct single-chain Fv molecules. 
There is a single open reading frame in the loxP 
sequence that could be incorporated into an scFv linker 
which would then be a substrate for Cre-catalysed 
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site-specif ic recombination* Placement of such modified 
scFv linker sequences at: one or both ends of -the genes 
to be fused can then result: in creation of continuous 
open reading frames in vivo or in vitro when Cre 
5 recombinase is provided. 

As with other site-specific recombination systems, 
Cre-catalysed recombination is reversible such that 
productive recombinants form only a fraction of the 
recombinants. Selection of productive rearrangements may 

10 be facilitated by use of a polA strain of bacteria, 
preferably E.coli or other gram negative bacterium. 
These cells are deficient in DNA polymerase I and are 
unable to support replication of plasmids (Johnston, S. 
and Ray, D.S. 1984, supra.). However, they are able to 

15 support replication of filamentous phage and plasmids 

containing filamentous phage intergenic regions. If Cre- 
catalysed recombination is performed in polA bacteria, 
by selecting for the presence of both selectable markers 
in the same pol A cell successful recombination events 

20 are enriched, since recombination must take place for 
the second marker gene to be replicated and expressed. 
The resulting cells then contain the complete repertoire 
and can be propagated as cells and infected with helper 
phage to produce phagemids containing the genes for both 

25 chains and expressing them on their surface. 

Another way of enriching for productive 
recombination events is to employ mutant loxP sites. 
Several mutants of the loxP sequence are known, and 
these are compromised with respect to their ability to 

30 recombine with each other and the wild- type loxP 

sequence ( Hoes s , R.H. , Wier zbicki , A . and Abr emski , K . 
(1986) Nucl. Acids Res. 14, 2287-2300). For example, 
loxP 511 has a G->A point mutation in the central 8bp 
segement, with the result that it will only recombine 



WO 93/19172 



PCT/GB93/00605 



33 

with other loxP 511 sites, but not the wild-type loxP 
sequence (Hoess, R.H., Wierzbicki, A. and Abremski, K. 
(1986) et supra.)- Placement of wild- type and mutant 
loxP sequence combinations can direct which 
5 recombination events are possible: their use is 

described in example 1. Other mutant loxP sites are 
known but their abilities to recombine with each other 
and the wild-type loxP sequence have not been 
extensively characterised, presumably loxP 511 is not 

10 unique. Provision of different mutant loxP sites in the 
vectors would permit even greater control over the 
occurance of recombination events perhaps leading to 
more complex, controllable and efficient recombination 
strategies being possible. 

15 The presence of target DNA sequences for 

site-specific recombination in the vectors has utility 
for subsequent manipulation of the genes. Naturally 
occurring or artificially introduced loxP sequences in 
the genomes of prokaryotic and eukaryotic organisms can 

20 be used as target sites for insertion of genes* 

Moreover, since Cre-catalysed recombination occurs 
readily in vitro, rapid and efficient transfer of genes 
in vitro, for example between different vectors, is also 
contemplated (Boyd, A.C. (1993) Nuc. Acids Res. 21, 

25 817-821) 

It will be apparent that the concept of using two 
or more replicons to generate diversity is not confined 
to display of mul timers on the surface of filamentous 
bacteriophages. For example, bacteria could be used as 
30 the replicable genetic display package. For example, 

Fuchs et al. have shown that functional antibody can be 
displayed on the surface of E.coli by fusion to 
peptidoglycan-associated lipoprotein ( Fuchs , P . , 
Breitling, F. , Dubel, S., Seehaus, T and Little, M. 
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(1991) Targetting of recombinant: antibodies to -the 
surface of Escherichia coli: fusion -to a peptidoglycan 
associated lipoprotein. Biotechnology 9, 1369-1373.). 
Klauser et al. describe transport of a heterologous 
5 protein to the surface of E.coli by fusion to Neisseria 
IgA protease (Klauser, T., Pohler, J. and Meyer, T. F. 
(1990) Extracellular transport of cholera toxin B 
subunit using Neisseria IgA protease B domain: 
conformation- dependent outer membrane translocation. 

10 EMBO 9, 1991-1999). Other surface proteins such as pili, 
ompA or the surface-exposed lipoprotein Tra T could also 
be used, and gram positive organisms such as 
lactobacilli and streptococci employed. Cloning and 
expression in Eukaryotic organisms is also contemplated. 

15 Alternative cloning strategies are possible when 

cells are used in place of phage. For example, replicons 
can be introduced into the cells by conjugation, in 
addition to transformation and infection. Moreover, one 
or more genes can be recombined or transposed into the 

20 chromosome reducing the limitation of having to use 
compatible replicons . 

The polycombinatorial concept is also particularly 
advantageous for mutagenesis experiments by allowing far 
greater numbers of mutant progeny to be produced. For 

25 example, if the genes encoding a mul timer ic peptide or 
polypeptide are mutated at a total of 10 amino acid 
positions, to incorporate any amino acid at these 
positions, then the total number of combinations is 
20 10 => 1.024 10 13 . This figure is way beyond the reach of 

30 standard cloning formats, but can be achieved using the 
approaches described here. 

The methods described here are applicable to 
multimeric proteins other than antibodies, such a T cell 
receptors, CD3 and insulin receptor. Libraries of 
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proteins having more -than two different and diverse 
subunits can be created by, for example, more than one 
cycle of infection. Cells containing one of the subunits 
are infected with phage containing the second subunit 
5 and the resulting population infected a second time with 
a compatible phage carrying the third subunit. 

In some cases, it is advantageous to express all 
components of the multimer as g3 fusions. This will have 
the benefit stabilising weak interactions between 

10 seperate chains, e.g. VHg3 and VLg3 to create phage or 
phagemid particles with both VH and VL fused to g3 on 
the same particle, or stabilising polypeptides which 
interact weakly, or polypeptides which only associate in 
the presence of ligand. 

15 The numbers of combinations possible with the 

polycombinatorial approach is limited only by the number 
of clones present in each of the repertoires, and, in 
the specific instance of using phage supplying one chain 
to infect cells containing the other, by the numbers of 

20 phage and cells that can be produced. The use of more 
sophisticated methods, for example fermentation 
technology, will allow even greater numbers of 
combinations to be accessed. 

25 The nucleic acid encoding first and second 

polypeptide components of antibodies may be derived from 
the repertoire of an immunised or unimmunised animal or 
human, or from an artificially rearranged immunoglobulin 
gene or genes. Artificial rearrangement of 

30 immunoglobulin genes may involve joining of germ- line V 
segments in vitro to J segments and, in the case of VH 
domains, D segments. Any of the V, D and J segments may 
be synthetic. The joining may use a PCR-based process 
which may use primers which have a region of random 
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sequence to introduce sequence diversity into the 
product, artificially rearranged immunoglobulin genes. 

Filamentous F- specific bacteriophages are suitable 
5 examples of the type of phage which provide a vehicle 
for the display of binding molecules e.g. antibodies and 
antibody fragments and derivatives thereof, on their 
surface and facilitate subsequent selection and 
manipulation . 

10 The F-specific phages (e.g. fl, fd and M13) have 

evolved a method of propagation which does not kill the 
host cell and they are used commonly as vehicles for 
recombinant DNA (Kornberg, A. , DNA Replication, W.H. 
Freeman and Co., San Francisco, 1980). Gene III of 

15 phage fd is attractive for the insertion of biologically 
active foreign sequences. There are however, other 
candidate sites including for example gene VIII and gene 
VI. - 

The protein encoded by gene III has several domains 
20 (Pratt, D., et al., 1969 Virology 39 : 42-53. , Grant, 
R.A., et al . , 1981, J. Biol. Chem. 256 : 539-546 and 
Armstrong, J., et al. , FEBS Lett. 135 : 167-172 1981). 

The gene coding sequences for biologically active 
antibody fragments have been inserted into the gene III 
25 region of fd to express a large fusion protein. An 
initial vector used was fd-tet (Zacher, A.N., et al., 
1980, Gene £, 127-140) a tetracycline resistant version 
of fd bacteriophage that can be propagated as a plasmid 
that confers tetracycline resistance to the infected 
30 E . coll host. The applicants chose to insert after the 
signal sequence of the fd gene III protein for several 
reasons. In particular, the applicants chose to insert 
after amino acid 1 of the mature protein to retain the 
context for the signal peptidase cleavage. To retain 
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the structure and function of gene III itself, the 
majority of the original amino acids are synthesized 
after the inserted immunoglobulin sequences. The 
inserted immunoglobulin sequences were designed to 
5 include residues from the switch region that links VH-VL 
to CH1-CL (Lesk, A., and Chothia, C. ,. Nature 335, 188- 
190, 1988). 

By manipulating gene III of bacteriophage fd, one 
can construct a bacteriophage that displays on its 

10 surface large biologically functional antibody, enzyme, 
and receptor molecules whilst remaining intact and 
infectious. Furthermore, the phages bearing antibodies 
of desired specificity, can be selected from a 
background of phages not showing this specificity. 

15 The sequences coding for a population of antibody 

molecules and for insertion into the vector to give 
expression of antibody binding functions on the phage 
surface can be derived from a variety of sources. For 
example, immunised or non- immunised rodents or humans, 

20 and from organs such as spleen and peripheral blood 
lymphocytes. The coding sequences are derived from 
these sources by techniques familiar to those skilled in 
the art (Orlandi, R. , et al., 1989 supra; Larrick, J.W., 
et al., 1989 supra; Chiang, Y.L., et al., 1989 Bio 

25 Techniques 7, p. 360-366; Ward, E.S, et al., 1989 supra; 
Sastry, L., et al., 1989 supra.) 

In standard recombinant techniques for the 
production of antibodies, an expression vector 
containing sequences coding for the antibody polypeptide 

30 chains is used to transform e.g. E.coli. The antibody 
polypeptides are expressed and detected by use of 
standard screening systems . When the screen detects an 
antibody polypeptide of the desired specificity, one has 
to return to the particular transformed E.coli . 
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expressing the desired antibody polypeptide. 
Furthermore, the vector containing the coding sequence 
for the desired antibody polypeptide then has to be 
isolated for use from E.coli in further processing 
5 steps . 

In the present invention however, the desired 
antibody polypeptide when expressed, is already packaged 
with its gene coding sequence. This means that when the 
an antibody polypeptide of desired specificity is 

10 selected, there is no need to return to the original 

culture for isolation of that sequence. Furthermore, in 
previous methods in standard recombinant techniques, 
each clone expressing antibody needs to be screened 
individually. The present application provides for the 

15 selection of clones expressing antibodies with desired 
properties . 

Because a rgdp (eg a pAb) displays a member of a 
specific binding pair (eg. an antibody of monoclonal 
antigen-binding specificity) at the surface of a 

20 relatively simple replicable structure also containing 
the genetic information encoding the member, rgdps eg 
pAbs, that bind to the complementary member of the 
specific binding pair (eg antigen) can be recovered very 
efficiently by either eluting off the complementary 

25 member using for example diethylamine, high salt etc and 
infecting suitable bacteria, or by denaturing the 
structure, and specifically amplifying the sequences 
encoding the member using PCR. That is, there is no 
necessity to refer back to the original bacterial clone 

30 that gave rise to the pAb. 

SELECTION FORMATS AND AFFINITY MATURATION 



Individual rgdps eg pAbs expressing the desired 
specificity eg for an antigen, can be isolated from the 
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complex library using the conventional screening 
techniques (e.g. as described in Harlow, E. , and Lane, 
D., 1988, supra Gherardi, E et al. 1990. J. Immunol, 
meth. 126 p61-68). 
5 Other selection techniques, described and 

illustrated in WO 92/01047, are practicable only because 
of the unique properties of rgdps. The general outline 
of some screening procedures is illustrated in figure 5 
using pAbs as an example type of rgdp. 

10 The population/library of pAbs to be screened could 

be generated from immunised or other animals; or be 
created in vitro by mutagenising pre-existing phage 
antibodies (using techniques well-known in the art such 
as oligonucleotide directed mutagenesis (Sambrook, J., 

15 et al., 1989 Molecular Cloning a Laboratory Manual, Cold 
Spring Harbor Laboratory Press ) . This population can be 
screened in one or more of the formats described below 
with reference to figure 5, to derive those individual 
pAbs whose antigen binding properties are different from 

20 sample c. 

Binding Elution 

Figure 5(i) shows antigen (ag) bound to a solid 
surface (s) the solid surface (s) may be provided by a 
petri dish, chromatography beads, magnetic beads and the 

25 like. The population/library of pAbs is then passed 
over the ag, and those individuals p that bind are 
retained after washing, and optionally detected with 
detection system d. A detection system based upon anti- 
fd antisera is illustrated in more detail in example 4 

30 of WO 92/01047. If samples of bound population p are 
removed under increasingly stringent conditions, the 
binding affinity represented in each sample will 
increase. Conditions of increased stringency can be 
obtained, for example, by increasing the time of soaking 
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or changing the pH of the soak solution , etc. 
Competition 

Referring to figure 5(11) antigen ag can be bound 
to a solid support s and bound to saturation by the 
5 original binding molecule c. If a population of mutant 
pAb (or a set of unrelated pAbs) is offered to the 
complex, only those that have higher affinity for 
antigen ag than c will bind. In most examples, only a 
minority of population c will be displaced by 

10 individuals from population p. If c is a traditional 
antibody molecule, all bound material can be recovered 
and bound p recovered by infecting suitable bacteria 
and/or by use of standard techniques such as PCR. 

An advantageous application is where ag is used as 

15 a receptor and c the corresponding ligand. The 
recovered bound population p is then related 
structurally to the receptor binding site/and or ligand. 
This type of specificity is known to be very useful in 
the pharmaceutical industry. 

20 Another advantageous application is where ag is an 

antibody and c its antigen. The recovered bound 
population p is then an anti-idiotype antibody which 
have numerous uses in research and the diagnostic and 
pharmaceutical industries . 

25 At present it is difficult to select directly for 

anti-idiotype antibodies. pAbs would give the ability 
to do this directly by binding pAb libraries (eg a naive 
library) to B cells (which express antibodies on their 
surface) and isolating those phage that bound well. 

30 In some instances it may prove advantageous to pre- 

select population p. For example, in the anti-idiotype 
example above, p can be absorbed against a related 
antibody that does not bind the antigen. 

However, if c is a pAb, then either or both c and p 
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can advantageously be marked in some way "to both 
distinguish and select for bound p over bound c. This 
marking can be physical, for example, by pre-labelling p 
with biotin; or more advantageously, genetic. For 
5 example, c can be marked with an EcoB restriction site, 
whilst p can be marked with an EcoK restriction site 
(see Carter, P. et al., 1985, Nucl. Acids Res. 13, 4431- 
4443). When bound p+c are eluted from the antigen and 
used to infect suitable bacteria, there is restriction 

10 (and thus no growth) of population c (i.e. EcoB 

restricting bacteria in this example ) . Any phage that 
grew, would be greatly enriched for those individuals 
from p with higher binding affinities. Alternatively, 
the genetic marking can be achieved by marking p with 

15 new sequences, which can be used to specifically amplify 
p from the mixture using PCR. 

Since the bound pAbs can be amplified using for 
example PCR or bacterial infection, it is also possible 
to rescue the desired specificity even when insufficient 

20 individuals are bound to allow detection via 
conventional techniques . 

The preferred method for selection of a phage 
displaying a protein molecule with a desired specificity 
or affinity will often be elution from an affinity 

25 matrix with a ligand (eg example 21 of WO 92/01047). 

Elution with increasing concentrations of ligand should 
elute phage displaying binding molecules of increasing 
affinity. However, when eg a pAb binds to its antigen 
with high affinity or avidity (or another protein to its 

30 binding partner) it may not be possible to elute the pAb 
from an affinity matrix with molecule related to the 
antigen. Alternatively, there may be no suitable 
specific eluting molecule that can be prepared in 
sufficiently high concentration. In these cases it is 
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necessary "to use an elution method which Is not: specific 
to eg the antigen- antibody complex. Some of the non- 
specific elution methods generally used reduce phage 
viability for instance, phage viability is reduced with 
5 time at pH12 (Rossomando, E.F. and Zinder N.D. J. 

Mol.Biol. 36 387-399 1968). There may be interactions 
between eg antibodies and affinity matrices which cannot 
be disrupted without completely removing phage 
infectivity. In these cases a method is required to 

10 elute phage which does not rely on disruption of eg the 
antibody - antigen interaction. A method was therefore 
devised which allows elution of bound pAbs under mild 
conditions (reduction of a di thiol group with 
dithiothreitol ) which do not disrupt phage structure 

15 (example 47 of WO 92/01047). 

This elution procedure is just one example of an 
elution procedure under mild conditions. A particularly 
advantageous method would be to introduce a nucleotide 
sequence encoding amino acids constituting a recognition 

20 site for cleavage by a highly specific protease between 
the foreign gene inserted, in this instance a gene for 
an antibody fragment, and the sequence of the remainder 
of gene III. Examples of such highly specific proteases 
are Factor X and thrombin. After binding of the phage 

25 to an affinity matrix and elution to remove non-specific 
binding phage and weak binding phage, the strongly bound 
phage would be removed by washing the column with 
protease under conditions suitable for digestion at the 
cleavage site. This would cleave the antibody fragment 

30 from the phage particle eluting the phage. These phage 
would be expected to be infective, since the only 
protease site should be the one specifically introduced. 
Strongly binding phage could then be recovered by 
infecting eg. E.coli TGI cells. 
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An alternative procedure to the above is to take 
the affinity matrix which has retained the strongly 
bound pAb and extract the DNA, for example by boiling in 
SDS solution. Extracted DNA can then be used to 
5 directly transform E.coli host cells or alternatively 
the antibody encoding sequences can be amplified, for 
example using PCR with suitable primers such as those 
disclosed herein, and then inserted into a vector for 
expression as a soluble antibody for further study or a 
10 pAb for further rounds of selection. 

Another preferred method for selection according to 
affinity would be by binding to an affinity matrix 
containing low amounts of ligand. 

If one wishes to select from a population of phages 
15 displaying a protein molecule with a high affinity for 
its ligand, a preferred strategy is to bind a population 
of phage to an affinity matrix which contains a low 
amount of ligand. There is competition between phage, 
displaying .high affinity and low affinity proteins, for 
20 binding to the ligand on the matrix. Phage displaying 
high affinity protein is preferentially bound and low 
affinity protein is washed away. The high affinity 
protein is then recovered by elution with the ligand or 
by other procedures which elute the phage from the 
25 affinity matrix (example 35 of WO 92/01047 demonstrates 
this procedure). 

In summary then, for recovery of the packaged DNA 
from the affinity step, the package can be simply 
eluted, it can be eluted in the presence of a homologous 
30 sbp member which competes with said package for binding 
to a complementary sbp member; it could be removed by 
boiling, it could be removed by proteolytic cleavage of 
the protein; and other methods will be apparent to those 
skilled in the art eg. destroying the link between the 
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substrate and complementary sbp member "bo release said 
packaged DNA and sbp member. At: any rate, the objective 
is to obtain the DNA from the package so that it can be 
used directly or indirectly, to express the sbp member 
5 encoded thereby. 

The efficiency of this selection procedure for pAbs 
and the ability to create very large libraries means 
that the immunisation techniques developed to increase 
the proportion of screened cells producing antibodies of 

10 interest will not be an absolute requirement. The 
technique allows the rapid isolation of binding 
specificities eg antigen-binding specificities, 
including those that would be difficult or even 
unobtainable by conventional techniques, for example, 

15 catalytic or anti-idiotypic antibodies. Removal of the 
animal altogether is now possible, once a complete 
library of the immune repertoire has been constructed. 

The structure of the pAb molecule can be used in a 
number of other applications, some examples of which 

20 are: 

Signal Amolif ication 

Acting as a molecular entity in itself, rgdps eg 
pAbs combine the ability to bind a specific molecule eg 
antigen with amplification, if the major coat protein is 

25 used to attach another moiety. This moiety can be 

attached via immunological, chemical, or any other means 
and can be used, for example, to label the complex with 
detection reagents or cytotoxic molecules for use in 
vivo or in vitro . 

30 Physical Detection 

The size of the rgdps eg pAbs can be used as a 
marker particularly with respect to physical methods of 
detection such as electron microscopy* and/or some 
biosensors, e.g. surface plasmon resonance. 
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Diagnostic Assays 

The rgdps eg pAbs also have advantageous uses in 
diagnostic assays, particularly where separation can be 
effected using their physical properties for example 
5 centrifugation, filtration etc. 

Example 1: In vivo recombination of antibody genes 
between replicons using Cre/lox 

10 This example illustrates using the Cre/loxP system 

to transfer antibody genes between two replicons in the 
same cell. Here, recombination must occur to produce a 
functional pairing of antibody genes. 

Two constructs were made: an "acceptor" fd phage 

15 vector, fdD0G-21ox (A) and a "donor" plasmid vector, 
pUC19-21ox (B) (see Fig 4. and legend). A encodes the 
light chain of a first antibody (and the heavy chain 
from a second, different antibody): B encodes the heavy 
chain of the first antibody. In both vectors the VH 

20 genes are flanked by two loxP sites ( see Fig 4 . ) # To 
avoid deletion of the VH genes in the presence of Cre, 
one of the loxP sites is wild-type but the other 
contains a G to A point mutation within the 8 bp spacer 
region loxP 511 (Hoess, R.H. , Wierzbicki, A. and 

25 Abremski, K. (1986) et supra.). The wild-type loxP site 
and the mutant loxP 511 site do not recombine with each 
other in the same vector, but will, as shown below, 
recombine with sites of matching sequence in different 
vectors. When Cre recombinase is provided in vivo by 

30 infecting the E. coli with phage PICm cl.lOO (Rosner, 
J.L. (1972) Virology, 48, 679-689), A and B can 
co- integrate by recombination between either mutant or 
wild-type loxP sites to create chimaeric piasmids C or D 
respectively. Further recombination can then occur 
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to generate the original vectors (A and B) or two new 
vectors ( E and F ) . The heavy chains of A and B are 
therefore exchanged, and E now encodes the Fab fragment 
5 of the first antibody for display as a fusion to the 
N- terminus of the phage gene 3 protein (g3p). 

fa) Construction of fdDOG-21ox and pUC19-21ox vec tors. 

FdD0G-21ox and pUC19-21ox vectors were derived from 

10 fdDOG-l and pUC19 respectively (WO 92/01047 and WO 
92/20791; fdDOG-l previously called fdCAT-2). The 
cloning sites of these vectors were engineered using a 
combination of site-directed mutagenesis and ligation of 
double- stranded synthetic oligonucleotides using 

15 standard molecular biology techniques (Sambrook, J., 
Fritsch, E.F. and Maniatis, T. (1990) "Molecular 
cloning-a laboratory manual". Cold Spring Harbor 
Laboratory, New York. ) . 

These constructs were used to produce donor plasmid B 
20 and acceptor phage A depicted in figure 4. Plasmid B 
contains the VH gene of the anti-phOx 

(2-phenyloxazol-5-one) hybridoma NQ10.12.5 (Griffiths, 
G.M. , Berek, C. , Kaartinen, M. and Milstein, C. (1984) 
Nature , 312, 271-275. ) linked to a human Cgl segment, 

25 and cloned into pUC19-21ox as an Sfi 1-Not 1 fragment. 
Acceptor phage A contains the VL partner of the 
anti-phOx hybridoma NQ10.12.5 linked to a human Ckl 
segment cloned into fdD0G-21ox as an Apa Ll-Asc I 
fragment. Acceptor phage A also contains a VH segment 

30 from an anti-Tumour Necrosis Factor antibody (Rathjen, 
D.A. r Furphy, i.J. and Aston r R. (1992) Br. J. Cancer, 
65, 852-856.) linked to a human Cml segment, and cloned 
into fdDOG-21ox as an Sfi 1-Not 1 fragment. 

Both A and B constructs were transformed into 
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E.coli TGI, construct A conferring resistance to 
tetracyclin, construct B conferring resistance to 
ampicillin. 

5 (b) Preparation of infectious acceptor phage particles 
(construct A) . 

Phage particles were harvested from the medium of 
construct B clones grown overnight in 2x YT containing 
tetracycline, as described in PCT WO 92/01047, example 
10 6. 

(c) In vivo Cre-catalvsed recombination. 
This was performed as follows: 

1. E. coli containing the plasmid pUC19-21ox were 
15 grown, shaking at 37 °C in 2 ml 2xTY medium with 100 

mg/ml ampicillin and 1% glucose to an O.D.600nm of 0.4, 

2. 5 x 10 9 transducing units (tu) fdD0G-21ox phage were 
added (a ten-fold excess over bacteria) and incubation 

20 continued at 37 °G without shaking for 30 min. 

3. 5 x 10 9 pfu phage PICm cl.100 (confer 
chloramphenicol resistance; Rosner, J.L. (1972) et. 
supra. ) were added and incubation continued for a 
further 30 min. at 37 °C. 40 ml of this culture were 

25 then added to 2 ml 2 xTY, 100 mg/ml ampicillin, 12.5 
mg/ml tetracycline, 12.5 mg/ml chloramphenicol, 1% 
glucose. The culture was shaken for 40 hours at 30 °C. 

4. About 10 10 tu phage fd particles (including 
recombinant phage) were harvested from the culture 

30 supernatant by centrifuging out bacteria at 13000 g for 
5 min. and passing the supernatant through a 0.45 mm 
sterile filter (Minisart, Sartorius). 



In order to sample the recombined population, 10 3 tu 



WO 93/19172 



PCT/GB93/00605 



48 

of *the above fd particles were infected into fresh 
E.coli TGI and plated on 2 xTY agar containing 12.5 
mg/ml tetracycline then incubated at 37 °C overnight. 
Ninety six well seperated colonies were transferred to a 
5 96 well microtitre tray containing lOOml/well 2xTY 
containing 12.5 mg/ml tetracycline and grown at 37 °C 
overnight. This plate was used as a master stock which 
was then screened by several techniques to identify 
which recombination events had occurred: 
10 (1) EL1SA, to identify clones producing phage that bind 
to phOx-BSA ( to identify vector E ) . 

(2) Replica plating , to find clones resisitant to both 
ampicillin and tetracycline (to identify vectors C and 
D). 

15 (3) Colony hybridisation, with a radiolabelled 

oligonucleotide VHNQ10PR which binds specifically to 
CDR3 of NQ10.12.5 VH (to identify vectors C, D and E). 
(4) PCR r with oligonucleotides FDPCRBACK and VHNQ10PR 
( to identify vectors C and E ) . 

20 (5) PCR, with oligonucleotides LMB3 and VHNQ10PR (to 
identify vector D ) . 

(d) ELISA t:o identify phOX binders (vector E) 

25 1. Coat plate (Falcon 3912) with 100 pi of phOX-BSA 

(14:1 substitution) per well at 10 pg/ml, in PBS. Leave 
overnight at room temp. 

2. Rinse wells 3x with PBS, and block with 200 pi per 
well of 2% Marvel /PBS r for 2hs at 37 °C. 
30 3. Rinse wells 3x with PBS, then add 25 pi 10% 
Marvel/PBS to all wells. 

4. Add 100 pi culture supernatant to the appropriate 
wells. Mix r leave 2 hrs room temp. 

5. Wash out wells 3. times with T?BS r 0.05% Tween 20 and 
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3 times with PBS. Add 100ml sheep anti-M13 antiserum 
diluted 1:1000 in 2% Marvel/PBS into each well. 
Incubate at room temp, for 1.5 hrs. 

6. Wash out wells with 3 times with PBS, 0.05% Tween 
5 20 and 3 times with PBS. Pipette 100 \il of 1:5000 

dilution of anti- sheep IgG antibody 

(peroxidase-conjugated, Sigma). Incubate at room temp, 
for 1 . 5 hrs . 

7. Discard 2nd antibody, and wash wells 3 times with 
10 PBS, 0.05% Tween 20 and 3 times with PBS. 

8. Add one 10 mg ABTS (2,2'-azino bis 
(3-ethylbenzthiazoline -6-sulphonic acid), diammonium 
salt) tablet to 20 ml 50 mM citrate buffer, pH4.5. (50 
mM citrate buffer, pH4.5 is made by mixing equal volumes 

15 50 mM trisodium citrate and 50 mM citric acid). 

9. Add 20 ]il 30% hydrogen peroxide to the above 
solution immediately before dispensing. 

10. Add 100 ill of the above solution to each well. 
Leave room temp. 30 min. 

20 11. Quench by adding 50 pi 3.2 mg/ml sodium fluoride. 
Read at 405 nm. 

Note 1 : 'Marvel' is dried milk powder. PBS is 5.84 g 
NaCl, 4.72 g Na 2 HP0 4 and 2.64 g NaH 2 P0 4 . 2H20, pH 7.2, in 1 
25 litre. 

68 of the 96 clones were found to be positive in 
the ELISA (O.D. 405nM >1.0); 71% of the tetracycline 
resistant clones therefore correspond to vector E (fig.) 
30 since they encode functional anti-phOX Fab fragments on 
phage . 

(e) Replica plating to identify vectors C and D. 

Cells from the master plate were inoculated onto a 
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2xYT agar plate containing 100 mg/ml ampicillin, 12.5 
mg/ml tetracycline and 1% glucose, using a 96 pin 
device. The plate was incubated at 37 °C overnight. 
Five colonies had grown up the next day indicating that 
5 5/96 clones had the structures shown in C or D. 

(f ) Colony hybridisation to identify vectors C, D and E. 

Colony hybridisation was performed with the array 
using standard techniques as described in Sambrook et 
10 al. (1989, supra.). The probe used was a radiolabelled 
oligonucleotide VHNQ10PR which binds specifically to 
CDR3 Of NQ10.12.5 VH. 

73 of the 96 colonies were positive and therefore 
correspond to vectors C r D or E. 

15 

(a) PGR screening to identify vectors C and E. 

PCR reactions were performed essentially as 
described in example 11 , WO 92/01047. Cells from each of 
the 96 clones were carefully transferred using a 
20 toothpick into 20ml sterile water in a 0.5ml centrifuge 
tube. The samples were then placed in a boiling water 
bath for 5 minutes and 2ml of this used as template for 
each 20ml PCR reaction. 

Thirty cycles of amplification were performed each of 
25 94°C 1 minute, 50°C 1 minute and 72°C 2 minutes, using 
primers FDPCRBACK and VHNQ10PR. PCR reaction products 
were resolved on 1% TAE agarose gels (Sambrook et al. 
(1989 ) supra. ) . 

72 of the 96 clones clones gave a ca. 1Kb PCR fragment 
30 and were thus scored as positive. These clones 
correspond to vectors C and E. 



(q) PCR screening to identify vector D. 

A second set of PCR reactions were performed on 
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cells from the array as described above, this time using 
primers LMB3 and VHNQ10PR. 

Only 1 of the 96 clones gave a ca. 400bp PCR fragment 
and was thus scored as vector D. 

5 

(h) Analysis of recomr - ants . 

The preceding experiments show that of the 96 
tetracycline resistant clones that were sampled, 23 were 
vector A, 4 vector C, 1 vector D and 68 vector E. All 

10 68 vector E clones produced phage which bound to 
phOx-BSA, but the remaining 28 clones did not (as 
expected). Thus, 70% of all tetracycline resistant 
clones corresponded to vector E, which encodes 
functional anti-phOx Fabs for display on phage. 

15 The process is very efficient, and should allow the 
creation and use of extremely large combinatorial 
repertoires . 

Example 2. Creation of an extremely large combinatorial 
20 library using in vivo recombination. 

This example describes construction of an extremely 
large library of V-genes from unimmunised donors, using 
the in vivo recombination strategy outlined in the 
25 previous example. Many of the procedures detailed below 
have been previously described (Marks, J et al. (1991) 
et supra . ) . 

(a) Preparation of cDNA template 
30 500 ml of blood, containing approximately 10 8 

B- lymphocytes, was obtained from 2 healthy volunteers. 
The white cells were separated on Ficoll and RNA was 
prepared using a modified method (Cathala, G., J. 
. Savouret, B. Mendez, B. L. Wesr, M. Karin, J. A. Martial 
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and J. D. Baxter. (1983). A method for isolation of 
intact, transcriptionally active ribonucleic acid. DNA. 
2 , 329 . ) . Three first strand cDNA syntheses were made 
as described by Marks et al (1991, supra.) from RNA 
5 corresponding to 2.5 X 10 7 B-cells, using HulgMFOR 
constant region primer for the heavy chains, and 
HuCKFORCYS for kappa light chains and HuCLFORCYS for 
lambda light chains (Table 1) 



10 (b) PCR of heavy chains and construction of heavy chain 
repertoire. 

VH genes were PCR-amplif ied using the HulgMFOR 
primer in conjunction with each of the HuVHBACK primers 
individually. Six separate PCR amplifications were 

15 performed each of 50 pi reaction volume containing 5 pi 
of the supernatant from the cDNA synthesis using the 
HUIGMFOR primer, 20 pmol total concentration of the BACK 
primers, 20 pmol concentration of the FORWARD primer, 
250 pM dNTPs, lOmM KC1, 10 mM (NH4)2S04, 20 mM Tris.HCl 

20 (pH 8.8), 2.0 mM MgC12, 100 mg/ml BSA and 1 pi (1 unit) 
Vent DNA polymerase ( New England Biolabs ) . The reaction 
mixture was overlaid with mineral (paraffin) oil and 
subjected to 30 cycles of amplification using a Techne 
PHC-2 thermal cycler. The cycle was 94 °C for 1 minute 

25 (denaturation) , 57 °C for 1 minute (annealing) and 72 °C 

for 2.5 minutes ( extension ) . The products were purified 
on a 1.0% agarose gel, isolated from the gel by 
Geneclean (Bio-101) and resuspended in 25 plof H 2 0. The 
six products were then pooled and 1 pullthrough r PCR 

30 reactions performed to attach Sfi 1 and Not 1 
restriction sites. 

Pullthrough reactions were set up with the primers 
HUVHBACKSfi (equimolar mix of all 6 primers) and 
HUCMIFONO. 50 ml reactions of containing 5 pi of the 
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pooled PCR products from the previous step were 
amplified using the same conditions as for the primary 
PCR except that 25 cycles of amplification were used. 
The resulting fragments were digested with Sfi I and Not 
5 I, gel-purified, and the fragments ligated to Sfi I and 
Not I -cut pUC19-21ox using previously described 
procedures (Sambrook, J. et al. (1989) et supra; PCT WO 
92/01047). The ligation mixes were phenol -chloroform 
extracted prior to electroporation into TGI cells 

10 (Marks, J et al. (1991) et supra.). Briefly, the 

ligated DNA was resuspended in 20 pi of water, and 2.5 
pi samples were electroporated into 50 pi aliquots of 
electro-competent E.coli TGI. Cells were grown in SOC 
for 1 hr and then plated on 2YT agar with 100 pg/ml 

15 ampicillin and 1% glucose (2YTAG) in 243 x 243 mm dishes 
(Nunc) then grown overnight at 30 °C. Colonies were 
scraped off the plates into 2YTAG containing 15% 
glycerol for storage at -70 °C as library stocks. 

The heavy chain repertoire was calculated to have 

20 ca. 1.10 7 independent recombinants, which by Bst NI 

fingerprinting was shown to be extremely diverse (PCT WO 
92/01047). 

(c) PCR of Light chains and construction of kappa and 

25 lambda-chain repertoires. 

Kappa and lambda-chain genes were amplified 
separately. Kappa chain genes were amplified using an 
equimolar mixture of the 12 SYNKB primers in conjunction 
with HuCKFORCYS (Table 1). 1-chain genes were amplified 

30 from the cDNA synthesis using an equimolar mix of the 8 
DPVL primers in conjunction with the HUCLFORCYS primer. 
In each case 50 \xl reaction mixtures were prepared 
containing 5 \xl of the supernatant from the appropriate 
cDNA synthesis, 20 pmol total concentration of the BACK 
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primers, 20 pmol concentration of the FORWARD primers; 
250 dNTPs, lOmM KC1, 10 mM (NH4)2S04, 20 mM Tris.HCl 
(pH 8.8), 2*0 mM MgC12, 100 mg/ml BSA and 1 ]il (1 unit) 
Vent DNA polymerase (New England Biolabs). The reaction 
5 mixture was overlaid with mineral (paraffin) oil and 
subjected to 30 cycles of amplification using a Techne 
thermal cycler. The cycle was 94 °C for 1 minute 
(denaturation), 57 °C for 1 minute (annealing) and 72 °C 
for 2.5 minutes (extension). The products were purified 

10 on a 1% agarose gel, isolated from the gel by Geneclean 
(Bio- 101) and resuspended in 25 \xl of H 2 0. 

Pull through reactions were now performed on each of 
the two light chain preparations, kappa-chain genes were 
amplified using an equimolar mixture of the 12 SYNKBApa 

15 primers in conjunction with either HUCKFORCYSNOT . 

lambda-chain genes were amplified using an equimolar 
mixture of the 8 DPVLApa primers in conjunction with 
HUCLFORCYSNOT . Pullthrough conditions were performed as 
for the primary light chain PCRs above except that 25 

20 cycles of amplification were used. 

Kappa and lambda-chain repertoires were processed 
seperately. In each case, PCR products were digested 
with Apa LI and Not I and ligated into Apa LI -Not I-cut 
fdD0G-21ox (prepared using the standard format), the 

25 ligation mixes were purified by phenol extraction and 
ethanol precipitated prior to electroporation into TGI 
as above, except that transformed cells were plated on 
2YT agar with 12.5 pg/ml tetracycline in 243 x 243 mm 
dishes (Nunc) then grown overnight at 30 °C. Colonies 

30 were scraped off the plates into 2YT containing 15% 
glycerol for storage at -70 °C as library stocks. 

The kappa and lambda-chain repertoires were 
calculated to have ca. 1.10 6 independent recombinants; 
again, Bst NI fingerprinting indicates that both 
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libraries were extremely diverse. 

(d) In vivo recombination of heavy and light: chains. 
The kappa and lambda-chain repertoires were 
5 seperately recombined with the heavy chain repertoire 
using a scale- up of the procedure described in example 
1. 

0.D.600nm was used to calculate the cell density of 
the stocks scraped from the plates, using the algorithm 

10 O.D.600nm of 1.0 = 5.10 s cells. Approximately 1.10 10 
cells from each of the kappa and lambda-chain 
repertoires in f dD0G-21ox were inoculated into 1 litre 
volumes of 2xYT containing 12.5 pg/ml tetracycline and 
grown for 30hrs at 37 C with rapid shaking. Phage 

15 particles were harvested from the clarified growth 

medium as described in PCT WO 92/01047, example 6, and 
stocks adjusted to ca. 1.10 12 TU ml-1. 

1.10 11 cells from the heavy chain repertoire were 
inoculated into 2x 1 litre volumes 2YTAG in 2.5L shake 

20 flasks and grown at 37 C with rapid shaking until the 

cultures reached an O.D. 600nm of 0.4 ml' 1 . 5.10 12 fdD0G-21ox 
kappa and lambda fdD0G-21ox phage were added (a ten- fold 
excess over bacteria) and incubation continued at 37 °C 
without shaking for 30 min. 5.10 12 pfu phage PICm cl.100 

25 were then added and incubation continued for a further 
30 min. at 37 °C. The cultures were then centrifuged at 
4,000x g for 15 minutes at 4°C and the supernatant 
poured off. The cell pellets were resuspended in 1 litre 
of 2 xTY, 100 mg/ml ampicillin, 12.5 mg/ml tetracycline, 

30 12.5 mg/ml chloramphenicol, 1% glucose and the cultures 
shaken for 40 hours at 30° C. Phage fd particles 
(including recombinant phage) were harvested from the 
culture supernatant by centrifuging out bacteria at 
13000 g for lSminutes and the particles PEG 
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precipitated . 

The recombined library phage were "then resuspended 
in lOmM TRIS-HC1 (pH 8.0), ImM EDTA and adjusted "bo 
1.10 12 TU ml-1: this stock represents the library. 
5 These phage are selected on antigen, reinfected into 
fresh E.coli and recovered by plating on 2x YT agar 
containing 12.5 \ig/ml tetracycline. Growth of selected 
phages is achieved by culture in 2x YT containing 12.5 
pg/ml tetracycline (no other antibiotics necessary- see 

10 fig 4 r construct E), and phages bearing functional 
antibodies recovered from the growth medium. 

Note: Sbp members and encoding nucleic acid 
therefor obtained using the present invention may be 
used in the production of derivatives. The term 

15 derivative is discussed above. 
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TABLE 1 Oligonucleotide sequences 



ALL WRITTEN 5 r -> 3' 



A) Primers for first strand cDNA synthesis 
Human IgM Constant Region Primer 

HulgMFOR 5 f -TGG AAG AGG CAC GTT CTT TTC TTT-3 1 

Human kappa Constant Region Primer 

HUCKFORCYS 5' -ACA CTC TCC CCT GTT GAA GCT CTT-3 f 

Human lambda Constant Region Primer 

HUCLFORCYS 5 f -TGA ACA TTC TGT AGG GGC CAC TGT 

CTT -3 ' 



B) Heavy chain primary PGR 

VH Primers 



HuVHl aBACK 


5' 


-CAG 


GTG 


CAG 


CTG 


GTG 


CAG 


TCT 


GG- 


•3' 


HuVH2aBACK 


5' 


-CAG 


GTC 


AAC 


TTA 


AGG 


GAG 


TCT 


GG- 


•3' 


HuVH3aBACK 


5' 


-GAG 


GTG 


CAG 


CTG 


GTG 


GAG 


TCT 


GG- 


3' 


HuVH4aBACK 


5' 


-CAG 


GTG 


CAG 


CTG 


CAG 


GAG 


TCG 


GG- 


•3' 


HuVH5aBACK 


5' 


-GAG 


GTG 


CAG 


CTG 


TTG 


CAG 


TCT 


GC- 


•3' 


HuVH6aBACK 


5' 


-CAG 


GTA 


CAG 


CTG 


CAG 


CAG 


TCA 


GG- 


■3' 


ird Primer 






















HulgMFOR 5 ' 


-TGG 


AAG 


AGG 


CAC 


GTT 


CTT 


TTC 


TTT- 


-3' 





C) Heavy chain reamplif ication with restriction site 
primers 

VH Back Primers 

HuVHlaBACKSfi 5 ' -GTC CTC GCA ACT GCG GCC CAG CCG GCC 

ATG GCC CAG GTG CAG CTG GTG CAG 
TCT GG-3 ' 

HuVH2aBACKSf i 5 ' -GTC CTC GCA ACT GCG GCC CAG CCG GCC 

ATG GCC CAG GTC AAC TTA AGG GAG 
TCT GG-3 * 

HuVH3 aBACKS f i 5 ' -GTC CTC GCA ACT GCG GCC CAG CCG GCC 

ATG GCC GAG GTG CAG CTG GTG GAG 
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HuVH4aBACKS f i 



HuVH5 aBACKS f i 



HuVHSaBACKSf x 



Forward primer 

HCM1FONO 5 ' 



58 



TCT GG-3' 

5' -GTC CTC GCA ACT GCG GCC CAG CCG GCC 
ATG GCC CAG GTG CAG CTG CAG GAG 
TCG GG-3' 

5 ' -GTC CTC GCA ACT GCG GCC CAG CCG GCC 
ATG GCC CAG GTG CAG CTG TTG CAG 
TCT GC-3* 

5' -GTC CTC GCA ACT GCG GCC CAG CCG GCC 
ATG GCC CAG GTA CAG CTG CAG CAG 
TCA GG-3' 



-CCA CGA TTC TGC GGC CGC CAC TGG AAG 
AGG CAC GTT CTT TTC TTT 



D) Kappa chain primary PCR 

Back primers 



SYNKB1 


5' 


-GAC 


ATC 


CAG 


( A/T ) TG 


ACC 


CAG- 


3' 


SYNKB2 


5' 


-GTC 


ATC 


TGG 


ATG 


ACC 


CAG- 


-3' 




SYNKB3 


5' 


-GCC 


ATC 


CAG 


ATG 


ACC 


CAG- 


-3' 




SYNKB4 


5' 


-GAT 


(A/G)TT 


GTG 


ATG 


ACT 


CAG- 


3' 


SYNKB5 


5* 


-GA(T/G) 


ATT 


GTG 


ATG 


ACC 


CAG- 


3' 


SYNKB6 


5' 


-GAA 


ATT 


GTG 


TTG 


ACG 


CAG- 


■3' 




SYNKB7 


5' 


-GAA 


ATA 


GTG 


ATG 


ACG 


CAG- 


-3' 




SYNKB8 


5' 


-GAC 


ATC 


GTG 


ATG 


ACC 


CAG- 


-3' 




SYNKB9 


5' 


-CAG 


CAG 


GGC 


AAT 


AAG 


CAC- 


-3' • 




SYNKB10 


5' 


-CAT 


CAG 


AGT 


AGT 


AGT 


TTA 


C-3' 




SYNKB11 


5' 


-AAC 


ATC 


CAG 


ATG 


ACC 


CAG- 


-3' 




SYNKB12 


5' 


-GAA 


ATT 


GTA 


ATG 


ACA 


CAG- 


-3' 





Forward Primer 
HUCKFORCYS 



see above 



E) Kappa chain, reamplif ication with primers containing 
restriction sites 



Back primers 

SYNKBlApa 
SYNKB2Apa 
SYNKB3Apa 
SYNKB4Apa 
SYNKB5Apa 



5' 
5' 
5' 
5' 



CAT GAC CAC AGT GCA CTT GAC ATC CAG 

( A/T )TG ACC CAG-3' 
CAT GAC CAC AGT GCA CTT GTC ATC TGG ATG 

ACC CAG-3' 
CAT GAC CAC AGT GCA CTT GCC ATC CAG ATG 

ACC CAG-3' 
CAT GAC CAC AGT GCA CTT GAT (A/G)TT GTG 
ATG ACT CAG-3' 
5 '-CAT GAC CAC AGT GCA CTT GA(T/G) ATT GTG 
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ATG ACC GAG-3 ' 



SYNKB6Apa 


5" 


-CAT 


GAC 


CAC 


AGT 


GCA 


CTT 


GAA 


ATT 


GTG 


TTG 




ACG CAG-3' 
















SYNKB7Apa 


5'' 


-CAT 


GAC 


CAC 


AGT 


GCA 


CTT 


GAA 


ATA 


GTG 


ATG 




ACG CAG-3' 
















SYNKB8Apa 


5' 


-CAT 


GAC 


CAC 


AGT 


GCA 


CTT 


GAC 


ATC 


GTG 


ATG 




ACC CAG-3' 
















SYNKB9Apa 


5' 


-CAT 


GAC 


CAC 


AGT 


GCA 


CTT 


CAG 


CAG 


GGC 


AAT 






AAG CAC-3' 
















SYNKBlOApa 


5'- 


-CAT 


GAC 


CAC 


AGT 


GCA 


CTT 


CAT 


CAG 


AGT 


AGT 




AGT TTA C- 


3' 














SYNKBllApa 


5'- 


-CAT 


GAC 


CAC 


AGT 


GCA 


CTT 


AAC 


ATC 


CAG 


ATG 






ACC CAG-3' 
















SYNKB12Apa 


5'- 


-CAT 


GAC 


CAC 


AGT 


GCA 


CTT 


GAA 


ATT 


GTA 


ATG 



ACA CAG-3' 

Forward primers 

HUCKFORCYSNOT 5 * -GAG TCA TTC TCG ACT TGC GGC CGC ACA 

CTC TCC CCT GTT GAA GCT CTT-3 ' 



F) Lambda chain primary PCR 

Back primers 



DPVLla 


5*- 


-CAG 


TCT 


GTG 


( T/C )TG 


ACG 


CAG 


CCG CC-3' 


DPVLlb 


5'- 


-CAG 


TCT 


GTC 


GTG 


ACG 


CAG 


CCG 


CC-3* 


DPVLlc 


5'- 


-CAG 


TCT 


GTG 


CTG 


ACT 


CAG 


CCA 


CC-3' 


DPVL2 


5'- 


-CA(G/A) 


TCT 


GCC 


CTG 


ACT 


CAG 


CCT-3 ' 


DPVL3a 


5'- 


-TCT 


TCT 


GAG 


CTG 


ACT 


CAG 


GAC 


CC-3 ' 


DPVL3b 


5'* 


-TCC 


TAT 


GAG 


CTG 


ACT 


CAG 


CCA 


CC-3' 


DPVIi7/8 


5*- 


-CAG 


(A/G)CT 


GTG 


GTG 


AC (T/C) 


CAG GAG 


DPVL9 


5'- 


-C(A/T)G 


CCT 


GTG 


CTG 


ACT 


CAG 


CC(A/C) 



CC-3 1 

Forward primer 

HUCLFORCYS see above 



6) Lambda chain reamplif ication with primers corrtaining 
restriction sites 

Back primers 

DPVLlaApa 5 '-CAT GAC CAC AGT GCA CTT CAG TCT GTG 

( T/C )TG ACG CAG CCG CC-3' 

DPVLlbApa 5 '-CAT GAC CAC AGT GCA CTT CAG TCT GTC GTG 

ACG CAG CCG CC-3 f 
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DPVLlcApa 5 ' -CAT GAC CAC AGT GCA CTT CAG TCT GTG CTG 

ACT CAG CCA CC-3 1 
DPVL2Apa 5' -CAT GAC CAC AGT GCA CTT CA(G/A) TCT GCC 

CTG ACT CAG CCT-3 ' 
DPVL3aApa 5 1 -CAT GAC CAC AGT GCA CTT TCT TCT GAG CTG 

ACT CAG GAC CC-3' 
DPVL3bApa 5 ' -CAT GAC CAC AGT GCA CTT TCC TAT GAG CTG 

ACT CAG CCA CC-3' 
DPVL7/8Apa 5 1 -CAT GAC CAC AGT GCA CTT CAG (A/G)CT GTG 

GTG AC(T/C) CAG GAG CC-3' 
DPVL9Apa 5' -CAT GAC CAC AGT GCA CTT C(A/T)G CCT GTG 

CTG ACT CAG CC(A/C) CC-3 r 

Forward primers 

HUCLFORCYSNOT 5 ' -GAG TCA TTC TCG ACT TGC GGC CGC TGA 

ACA TTC TGT AGG GGC CAC TGT CTT- 3 1 



H) Other primers/probes 

VHNQ10PR 5' -ATA AGC CCC GTA ATC TCT TGC-3 
FDPCRBACK 5' -GCB ATG GTT GTT GTC ATT GTC GGC-3 
LMB3 5' -CAG GAA ACA GCT ATG AC- 3 
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CLAIMS 

1. A method for producing multimeric specific 
binding pair (sbp) members, which method comprises 

5 causing or allowing recombination between (a) 

first vectors comprising nucleic acid encoding a 
population of a fusion of a first polypeptide chain of a 
specific binding pair member and a component of a 
replicable genetic display package (rgdp) and (b) second 

10 vectors comprising nucleic acid encoding a population of 
a second polypeptide chain of a specific binding pair 
member, at least one of said populations being 
genetically diverse, the recombination resulting in 
recombinant vectors each of which comprises nucleic acid 

15 encoding a said polypeptide fusion and a said second 
polypeptide chain and capable of being packaged into 
rgdps using said rgdp component. 

2. A method according to claim 1 comprising 

20 expressing said polypeptide fusions and said second 
polypeptide chains, producing rgdps which display at 
their surface said first and second polypeptide chains 
and which each comprise nucleic acid encoding a said 
first polypeptide chain and a said second polypeptide 

25 chain. 

3. A method according to claim 1 or claim 2 wherein 
the recombination is intracellular and promoted by 
inclusion in the vectors of sequences at which site- 

30 specific recombination will occur • 

4. A method according to claim 1 or claim 2 wherein 
the recombination takes place in vitro and is promoted 
by inclusion in the vectors of sequences at which site- 

35 specific recombination will occur • 

5. A method according to claim 3 or claim 4 wherein 
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said resultant recombinant: vector comprises nucleic acid 
encoding a single chain sbp member resulting from 
recombination between first and second vectors. 



5 6. A method according to any one of claims 3, 4 and 

5 wherein the sequences at which site- specif ic 
recombination will occur are loxP sequences obtainable 
from coliphage PI or sequences derived from such a loxP 
sequence, and site-specific recombination is catalysed 
10 by Cre-recombinase, obtainable from coliphage PI. 

7. A method according to claim 6 wherein the Cre- 
recombinase used is expressible under the control of a 
regulatable promoter. 

15 

8. A method according to claim 3 or claim 4 wherein 
each of the first vectors and each of the second vectors 
includes a first site- specif ic recombination sequence 
and a second site- specif ic recombination sequence 

20 different from the first, site-specific recombination 
taking place between first site-specific recombination 
sequences on different vectors and between second site- 
specific recombination sequences on different vectors 
but not between a first site- specific recombination 

25 sequence and a second site-specific recombination 
sequence on the same vector. 

9. A method according to claim 8 wherein the first 
site- specific recombination sequence is loxP obtainable 

30 from coliphage PI and the second site-specific 

recombination sequence is a mutant loxP sequence. 

10. A method according to claim 8 wherein the first 
site-specific recombination sequence is a loxP . mutant 

35 and the second site-specific recombination sequence is 
loxP obtainable from coliphage PI. 
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11 • A method according to claim 9 or claim 10 wherein 
the mutant loxP sequence is loxP 511. 

12. A method according to claim 1 wherein the first 
5 vectors are phages or phagemids and the second vectors 

are plasmids, or the first vectors are plasmids and the 
second vectors are phages or phagemids . 

13. A method according to claim 12 wherein the 

10 intracellular recombination takes place in a bacterial 
host which replicates the recombinant vector 
preferentially over the first vectors and the second 
vectors . 

15 14. A method according to claim 13 wherein the 

intracellular recombination takes place in a bacterial 
host which replicates plasmids preferentially over 
phages or phagemids, or which replicates phages or 
phagemids preferentially over plasmids. 

20 

15. A method according to claim 14 wherein said 
bacterial host is a PolA strain of E.coli or of another 
gram-negative bacterium. 

25 16 A method according to any preceding claim wherein 

nucleic acid from one or more rgdp's is taken and used 
in a further method to obtain an individual sbp member 
or a mixed population of sbp members, or polypeptide 
chain components thereof, or encoding nucleic acid 

30 therefor. 

17. A method of producing multimeric specific binding 
pair (sbp) members, which method comprises: 

(i) expressing from a vector in recombinant host 
35 organism cells a population of a first polypeptide chain 
of a specific binding pair member fused to a component 
of a replicable genetic display package (rgdp) which 
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thereby displays said polypeptide chains at the surface 
of rgdps, and combining said population with a 
population of a second polypeptide chain of said 
specific binding pair member by causing or allowing 
5 first and second polypeptide chains to come together to 
form a library of said multimeric specific binding pair 
members displayed by rgdps, said population of second 
polypeptide chains not being expressed from the same 
vector as said population of first polypeptide chains, 

10 at least one of said populations being genetically 

diverse and expressed from nucleic acid that is capable 
of being packaged using said rgdp component, whereby the 
genetic material of each said rgdp encodes a polypeptide 
chain of a said genetically diverse population; 

15 (ii) selecting or screening rgdps formed by said 

expressing to provide an individual sbp member or a 
mixed population of said sbp members associated in their 
respective rgdps with nucleic acid encoding a 
polypeptide chain thereof; 

20 (iii) obtaining nucleic acid from a selected or 

screened rgdp, the nucleic acid obtained being one of 
(a) nucleic acid encoding a first polypeptide chain, (b) 
nucleic acid encoding a second polypeptide chain, and 
(c) a mixture of (a) and (b); 

25 (iv) producing a recombinant vector by causing or 

allowing recombination between (a) a vector comprising 
nucleic acid obtained in step (iii) encoding a first 
polypeptide chain and a vector comprising nucleic acid 
encoding a second polypeptide chain, or (b) a vector 

30 comprising nucleic acid encoding a first polypeptide 

chain and a vector comprising nucleic acid obtained in 
step (iii) encoding a second polypeptide chain. 

IB. A method according to claim 17 wherein the 
35 recombination takes plaqe in vitro. 



19. A method according to claim 17 wherein the 
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recombination is intracellular . 

20. A method according to claim 18 or claim 19 
wherein the intracellular recombination is promoted by 

5 inclusion in the vectors of sequences at which site- 
specific recombination will occur, 

21. A method according to claim 20 wherein the 
recombinant vector comprises nucleic acid encoding a 

10 single chain sbp member resulting from recombination 
between first and second vectors. 

22. A method according to claim 20 or 21 wherein the 
sequences at which site-specific recombination will 

15 occur are loxP sequences obtainable from coliphage PI or 
sequences derived from such a loxP sequence, and site- 
specific recombination is catalysed by Cre-recombinase, 
also obtainable from coliphage PI. 

20 23. A method according to claim 22 wherein the Cre- 
recombinase used is expressible under the control of a 
regulatable promoter. 

24. A method according to claim 17 wherein the first 
25 vectors are phages or phagemids and the second vectors 

are plasmids, or the first vectors are plasmids and the 
second vectors are phages or phagemids. 

25. A method according to claim 24 wherein the 

30 intracellular recombination takes place in a bacterial 
host which replicates the recombinant vector 
preferentially over the first vectors and the second 
vectors . 

35 26* A method according to claim 25 wherein the 

intracellular recombination takes place in a bacterial 
host which replicates plasmids preferentially over 
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phages or phagemids, or which replicates phages or 
phagemids preferentially over plasmids. 

27. A method according to claim 26 wherein said 

5 bacterial host is a PolA strain of E.coli or of another 
gram-negative bacterium. 

28. A kit for use in carrying out a method according 
to any one of claims 3 to 11, having: 

10 (i) a first vector having a restriction site 

for insertion of nucleic acid encoding or a polypeptide 
component of an sbp member, said restriction site being 
in the 5' end region of the mature coding sequence of a 
phage capsid protein , with a secretory leader sequence 

15 upstream of said site which directs a fusion of the 
capsid protein and sbp polypeptide to the periplasmic 
space of a bacterial host; and 

(ii) a second vector having a restriction site 
for insertion of nucleic acid encoding a second said 

20 polypeptide chain, 

at least one of the vectors having an origin of 
replication for single-stranded bacteriophage, the 
vectors having sequences at which site- specific 
recombination will occur. 

25 

29. Recombinant host cells harbouring a library of 
first vectors each comprising nucleic acid encoding a 
first polypeptide chain of a sbp member fused to a 
component of a secret able replicable genetic display 

30 package (rgdp) and second vectors each comprising 

nucleic acid encoding a second polypeptide chain of a 
sbp member, the first vectors or the second vectors or 
both being capable of being packaged into rgdps using 
the rgdp component, and the vectors having sequences at 

35 which site-specific recombination will occur. 

30. A population of rgdps each displaying at its 
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surface a sbp member and each containing nucleic acid 
which encodes a first and a second polypeptide chain of 
the sbp member displayed at its surface and which 
includes a site-specific recombination sequence. 

5 

31. A population of rgdps each displaying at its 
surface a sbp member and each containing nucleic acid 
which comprises a combination of (i) nucleic acid 
encoding a first polypeptide chain of a sbp member and 
10 (ii) nucleic acid encoding a second poypeptide chain of 
a sbp member, the population containing 10 10 or more 
combinations of (i) and (ii) 
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